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1 Introduction 



1 . 1 Intended Audience 

This book presents a single semester course on Newtonian dynamics that is intended 
primarily for upper-division (i.e., junior and senior) undergraduate students majoring in 
physics. A thorough understanding of physics at the lower-division level, including a basic 
working knowledge of the laws of mechanics, is assumed. It is also taken for granted that 
students are familiar with the fundamentals of integral and differential calculus, complex 
analysis, ordinary differential equations, and linear algebra. On the other hand, vector 
analysis plays such a central role in the study of Newtonian dynamics that a brief, but 
fairly comprehensive, review of this subject area is provided in Appendix A. Likewise, 
those results in matrix eigenvalue theory which are helpful in the analysis of rigid body 
motion and coupled oscillations are discussed in Sections 8.5 and 11.4, respectively. Fi- 
nally, the calculus of variations, an area of mathematics that is central to Hamiltonian 
dynamics, is outlined in Section 10.2. 

1.2 Scope of Book 

The scope of this book is indicated by its title, "Newtonian Dynamics". "Dynamics" is 
the study of the motions of the various objects in the world around us. Furthermore, by 
"Newtonian", we understand that the theory which we are actually going to employ in our 
investigation of dynamics is that which was first formulated by Sir Isaac Newton in 1687. 

For the sake of simplicity, and brevity, we shall restrict our investigations to the motions 
of idealized point particles and idealized rigid bodies. To be more exact, we shall exclude 
from consideration any discussion of statics, the strength of materials, and the non-rigid 
motions of continuous media. We shall also concentrate, for the most part, on motions 
which take place under the influence of conservative forces, such as gravity, which can be 
accurately represented in terms of simple mathematical formulae. Finally, with one major 
exception, we shall only consider that subset of dynamical problems that can be solved by 
means of conventional mathematical analysis. 

Newtonian dynamics was originally developed in order to predict the motions of the 
objects which make up the Solar System. It turns out that this is an ideal application of the 
theory, since the objects in question can be modeled as being rigid to a fair degree of accu- 
racy, and the motions take place under the action of a single conservative force — namely, 
gravity — that has a simple mathematical form. In particular, the frictional forces which 
greatly complicate the application of Newtonian dynamics to the motions of everyday ob- 
jects close to the Earth's surface are completely absent. Consequently, in this book we shall 
make a particular effort to describe how Newtonian dynamics can successfully account for 
a wide variety of different solar system phenomena. For example, during the course of this 
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book, we shall explain the origins of Kepler's laws of planetary motion (see Chapter 5), 
the rotational flattening of the Earth, the tides, the Roche radius (i.e., the minimum radius 
at which a moon can orbit a planet without being destroyed by tidal forces), the forced 
precession and nutation of the Earth's axis of rotation, and the forced perihelion preces- 
sion of the planets (see Chapter 12). We shall also derive the Tisserand criterion used 
to re-identify comets whose orbits have been modified by close encounters with massive 
planets, account for the existence of the so-called Trojan asteroids which share the orbit of 
Jupiter (see Chapter 13), and analyze the motion of the Moon (see Chapter 14). 

Virtually all of the results described in this book were first obtained — either by Newton 
himself, or by scientists living in the 150, or so, years immediately following the initial 
publication of his theory — by means of conventional mathematical analysis. Indeed, sci- 
entists at the beginning of the 20th century generally assumed that they knew everything 
that there was to known about Newtonian dynamics. However, they were mistaken. The 
advent of fast electronic computers, in the latter half of the 20th century, allowed scien- 
tists to solve nonlinear equations of motion, for the first time, via numerical techniques. 
In general, such equations are insoluble using standard analytic methods. The numeri- 
cal investigation of dynamical systems with nonlinear equations of motion revealed the 
existence of a previously unknown type of motion known as deterministic chaos. Such 
motion is quasi-random (despite being derived from deterministic equations of motion), 
aperiodic, and exhibits extreme sensitivity to initial conditions. The discovery of chaotic 
motion lead to a renaissance in the study of Newtonian dynamics which started in the late 
20th century and is still ongoing. It is therefore appropriate that the last chapter in this 
book is devoted to an in-depth numerical investigation of a particular dynamical system 
that exhibits chaotic motion (see Chapter 15). 

1.3 Major Sources 

The material appearing in Appendix A is largely based on the author's recollections of a 
vector analysis course given by Dr. Stephen Gull at the University of Cambridge. Major 
sources for the material appearing in Chapters 2-13 include Dynamics, H. Lamb, 2nd Edi- 
tion (Cambridge University Press, Cambridge UK, 1923); Analytical Mechanics, G.R. Fowles 
(Holt, Rinehart, and Winston, New York NY, 1977); Solar System Dynamics, CD. Mur- 
ray, and S.F. Dermott (Cambridge University Press, Cambridge UK, 1999); Classical Me- 
chanics, 3rd Edition, H. Goldstein, C. Poole, and J. Safko (Addison-Wesley San Fransisco 
CA, 2002); Classical Dynamics of Particles and Systems, 5th Edition, S.T. Thornton, and 
J.B. Marion (Brooks/Cole — Thomson Learning, Belmont CA, 2004); and Analytical Me- 
chanics, 7th Edition, G.R. Fowles, and G.L. Cassiday (Brooks/Cole — Thomson Learning, 
Belmont CA, 2005). The various sources for the material appearing in Chapters 14 and 15 
are identified in footnotes. 
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2 Newton's Laws of Motion 



2.1 Introduction 

This chapter discusses Newton's laws of motion. 

2.2 Newtonian Dynamics 

Newtonian dynamics is a mathematical model whose purpose is to predict the motions of 
the various objects that we encounter in the world around us. The general principles of this 
model were first enunciated by Sir Isaac Newton in a work entitled Philosophiae Naturalis 
Principia Mathematica (Mathematical Principles of Natural Philosophy). This work, which 
was first published in 1687, is nowadays more commonly referred to as the Principa. 1 

Up until the beginning of the 20th century, Newton's theory of motion was thought 
to constitute a complete description of all types of motion occurring in the Universe. We 
now know that this is not the case. The modern view is that Newton's theory is only an 
approximation which is valid under certain circumstances. The theory breaks down when 
the velocities of the objects under investigation approach the speed of light in vacuum, and 
must be modified in accordance with Einstein's special theory of relativity. The theory also 
fails in regions of space which are sufficiently curved that the propositions of Euclidean ge- 
ometry do not hold to a good approximation, and must be augmented by Einstein's general 
theory of relativity . Finally, the theory breaks down on atomic and subatomic length-scales, 
and must be replaced by quantum mechanics. In this book, we shall neglect relativistic and 
quantum effects all together. It follows that we must restrict our investigations to the mo- 
tions of large (compared to an atom) slow (compared to the speed of light) objects moving 
in Euclidean space. Fortunately, virtually all of the motions which we commonly observe 
in the world around us fall into this category. 

Newton very deliberately modeled his approach in the Principia on that taken in Euclid's 
Elements. Indeed, Newton's theory of motion has much in common with a conventional 
axiomatic system such as Euclidean geometry. Like all such systems, Newtonian dynamics 
starts from a set of terms that are undefined within the system. In this case, the fundamen- 
tal terms are mass, position, time, and force. It is taken for granted that we understand 
what these terms mean, and, furthermore, that they correspond to measurable quantities 
which can be ascribed to, or associated with, objects in the world around us. In partic- 
ular, it is assumed that the ideas of position in space, distance in space, and position as 
a function of time in space, are correctly described by the Euclidean vector algebra and 

1 An excellent discussion of the historical development of Newtonian dynamics, as well as the physical 
and philosophical assumptions which underpin this theory, is given in The Discovery of Dynamics: A Study 
from a Machian Point of View of the Discovery and the Structure of Dynamical Theories, J.B. Barbour (Oxford 
University Press, Oxford UK, 2001). 



10 



NEWTONIAN DYNAMICS 



vector calculus discussed in Appendix A. The next component of an axiomatic system is 
a set of axioms. These are a set of unproven propositions, involving the undefined terms, 
from which all other propositions in the system can be derived via logic and mathematical 
analysis. In the present case, the axioms are called Newton's laws of motion, and can only 
be justified via experimental observation. Note, incidentally, that Newton's laws, in their 
primitive form, are only applicable to point objects. However, these laws can be applied to 
extended objects by treating them as collections of point objects. 

One difference between an axiomatic system and a physical theory is that, in the latter 
case, even if a given prediction has been shown to follow necessarily from the axioms of 
the theory, it is still incumbent upon us to test the prediction against experimental obser- 
vations. Lack of agreement might indicate faulty experimental data, faulty application of 
the theory (for instance, in the case of Newtonian dynamics, there might be forces at work 
which we have not identified), or, as a last resort, incorrectness of the theory. Fortunately, 
Newtonian dynamics has been found to give predictions which are in excellent agreement 
with experimental observations in all situations in which it is expected to be valid. 

In the following, it is assumed that we know how to set up a rigid Cartesian frame of 
reference, and how to measure the positions of point objects as functions of time within 
that frame. It is also taken for granted that we have some basic familiarity with the laws of 
mechanics, and standard mathematics up to, and including, calculus, as well as the vector 
analysis outlined in Appendix A. 

2.3 Newton's Laws of Motion 

Newton's laws of motion, in the rather obscure language of the Principia, take the follow- 
ing form: 

1. Every body continues in its state of rest, or uniform motion in a straight-line, unless 
compelled to change that state by forces impressed upon it. 

2. The change of motion (i.e., momentum) of an object is proportional to the force 
impressed upon it, and is made in the direction of the straight-line in which the force 
is impressed. 

3. To every action there is always opposed an equal reaction; or, the mutual actions of 
two bodies upon each other are always equal and directed to contrary parts. 

Let us now examine how these laws can be applied to dynamical systems. 

2.4 Newton's First Law of Motion 

Newton's first law of motion essentially states that a point object subject to zero net ex- 
ternal force moves in a straight-line with a constant speed (i.e., it does not accelerate). 
However, this is only true in special frames of reference called inertial frames. Indeed, we 
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Figure 2.1: A Galilean coordinate transformation. 



can think of Newton's first law as the definition of an inertial frame: i.e., an inertial frame 
of reference is one in which a point object subject to zero net external force moves in a 
straight-line with constant speed. 

Suppose that we have found an inertial frame of reference. Let us set up a Cartesian 
coordinate system in this frame. The motion of a point object can now be specified by 
giving its position vector, r = (x, y, z), with respect to the origin of the coordinate system, 
as a function of time, t. Consider a second frame of reference moving with some constant 
velocity u with respect to the first frame. Without loss of generality, we can suppose that 
the Cartesian axes in the second frame are parallel to the corresponding axes in the first 
frame, that u = (u, 0,0), and, finally, that the origins of the two frames instantaneously 
coincide at t = — see Figure 2.1. Suppose that the position vector of our point object is 
r' = (x', y z') in the second frame of reference. It is evident, from Figure 2.1, that at any 
given time, t, the coordinates of the object in the two reference frames satisfy 



This coordinate transformation was first discovered by Galileo Galilei, and is known as a 
Galilean transformation in his honor. 

By definition, the instantaneous velocity of the object in our first reference frame is 
given by v = dr/dt = (dx/dt, dy/dt, dz/dt), with an analogous expression for the veloc- 
ity, v', in the second frame. It follows from differentiation of Equations (2.1)-(2.3) with 
respect to time that the velocity components in the two frames satisfy 



x — ut, 



z. 



(2.1) 
(2.2) 
(2.3) 




u. 



(2.4) 
(2.5) 
(2.6) 
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These equations can be written more succinctly as 



v' = v — 



u. 



(2.7) 



Finally, by definition, the instantaneous acceleration of the object in our first reference 
frame is given by a = dv/ dt = (dv x / dt, dv y /dt, dv z /dt), with an analogous expression for 
the acceleration, a', in the second frame. It follows from differentiation of Equations (2.4)- 
(2.6) with respect to time that the acceleration components in the two frames satisfy 



According to Equations (2.7) and (2.11), if an object is moving in a straight-line with 
constant speed in our original inertial frame (i.e., if a = 0) then it also moves in a (differ- 
ent) straight-line with (a different) constant speed in the second frame of reference (i.e., 
a' = 0). Hence, we conclude that the second frame of reference is also an inertial frame. 

A simple extension of the above argument allows us to conclude that there are an 
infinite number of different inertial frames moving with constant velocities with respect to 
one another. Newton thought that one of these inertial frames was special, and defined 
an absolute standard of rest: i.e., a static object in this frame was in a state of absolute 
rest. However, Einstein showed that this is not the case. In fact, there is no absolute 
standard of rest: i.e., all motion is relative — hence, the name "relativity" for Einstein's 
theory Consequently, one inertial frame is just as good as another as far as Newtonian 
dynamics is concerned. 

But, what happens if the second frame of reference accelerates with respect to the first? 
In this case, it is not hard to see that Equation (2.11) generalizes to 




(2.12) 



where u(t) is the instantaneous velocity of the second frame with respect to the first. 
According to the above formula, if an object is moving in a straight-line with constant 
speed in the first frame (i.e., if a = 0) then it does not move in a straight-line with constant 
speed in the second frame (i.e., a' ^ 0). Hence, if the first frame is an inertial frame then 
the second is not. 

A simple extension of the above argument allows us to conclude that any frame of 
reference which accelerates with respect to a given inertial frame is not itself an inertial 
frame. 

For most practical purposes, when studying the motions of objects close to the Earth's 
surface, a reference frame which is fixed with respect to this surface is approximately 




(2.8) 
(2.9) 
(2.10) 



These equations can be written more succinctly as 



(2.11) 
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inertial. However, if the trajectory of a projectile within such a frame is measured to 
high precision then it will be found to deviate slightly from the predictions of Newtonian 
dynamics — see Chapter 7. This deviation is due to the fact that the Earth is rotating, and its 
surface is therefore accelerating towards its axis of rotation. When studying the motions of 
objects in orbit around the Earth, a reference frame whose origin is the center of the Earth, 
and whose coordinate axes are fixed with respect to distant stars, is approximately inertial. 
However, if such orbits are measured to extremely high precision then they will again be 
found to deviate very slightly from the predictions of Newtonian dynamics. In this case, 
the deviation is due to the Earth's orbital motion about the Sun. When studying the orbits 
of the planets in the Solar System, a reference frame whose origin is the center of the Sun, 
and whose coordinate axes are fixed with respect to distant stars, is approximately inertial. 
In this case, any deviations of the orbits from the predictions of Newtonian dynamics due to 
the orbital motion of the Sun about the galactic center are far too small to be measurable. 
It should be noted that it is impossible to identify an absolute inertial frame — the best 
approximation to such a frame would be one in which the cosmic microwave background 
appears to be (approximately) isotropic. However, for a given dynamical problem, it is 
always possible to identify an approximate inertial frame. Furthermore, any deviations 
of such a frame from a true inertial frame can be incorporated into the framework of 
Newtonian dynamics via the introduction of so-called fictitious forces — see Chapter 7. 



2.5 Newton's Second Law of Motion 



Newton's second law of motion essentially states that if a point object is subject to an 
external force, f, then its equation of motion is given by 




(2.13) 



where the momentum, p, is the product of the object's inertial mass, m, and its velocity, v. 
If m is not a function of time then the above expression reduces to the familiar equation 

m^ = f. (2.14) 
at 

Note that this equation is only valid in a inertial frame. Clearly, the inertial mass of an 
object measures its reluctance to deviate from its preferred state of uniform motion in a 
straight-line (in an inertial frame) . Of course, the above equation of motion can only be 
solved if we have an independent expression for the force, f (i.e., a law of force). Let us 
suppose that this is the case. 

An important corollary of Newton's second law is that force is a vector quantity. This 
must be the case, since the law equates force to the product of a scalar (mass) and a vector 
(acceleration). Note that acceleration is obviously a vector because it is directly related 
to displacement, which is the prototype of all vectors — see Appendix A. One consequence 
of force being a vector is that two forces, f] and f 2 , both acting at a given point, have the 
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same effect as a single force, f = fi + fi, acting at the same point, where the summation 
is performed according to the laws of vector addition — see Appendix A. Likewise, a single 
force, f, acting at a given point, has the same effect as two forces, fi and f 2 , acting at the 
same point, provided that f- l +f 2 = f. This method of combining and splitting forces is 
known as the resolution of forces, and lies at the heart of many calculations in Newtonian 
dynamics. 

Taking the scalar product of Equation (2.14) with the velocity, v, we obtain 

dv m d(v ■ v) m dv 2 . 
mv-— = — — - — - = — — =f v. (2.15) 
dt 2 dt 2 dt 



This can be written 
where 



^=f-v. (2.16) 
at 

K = lmv 2 . (2.17) 

The right-hand side of Equation (2.16) represents the rate at which the force does work 
on the object: i.e., the rate at which the force transfers energy to the object. The quantity 
K represents the energy the object possesses by virtue of its motion. This type of energy 
is generally known as kinetic energy. Thus, Equation (2.16) states that any work done on 
point object by an external force goes to increase the object's kinetic energy. 

Suppose that, under the action of the force, f, our object moves from point P at time 
ti to point Q at time t 2 . The net change in the object's kinetic energy is obtained by 
integrating Equation (2.16): 



AK 



ft2 rQ 

f-vdt= fdr, (2.18) 

t, Jp 



since v = dr/dt. Here, dr is an element of the object's path between points P and Q, and 
the integral in r represents the net work done by the force as the objects moves along the 
path from P to Q. 

As described in Section A. 15, there are basically two kinds of forces in nature. Firstly, 
those for which line integrals of the type jS f • dr depend on the end points, but not on 
the path taken between these points. Secondly, those for which line integrals of the type 
Jp f • dr depend both on the end points, and the path taken between these points. The first 
kind of force is termed conservative, whereas the second kind is termed non-conservative. It 
was also demonstrated in Section A. 15 that if the line integral Jp f • dr is path independent 
then the force f can always be written as the gradient of a scalar field. In other words, all 
conservative forces satisfy 

f=-VU, (2.19) 

for some scalar field U(r). Note that 

VU-dr = AU = U(Q)-U(P), (2.20) 

p 
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irrespective of the path taken between P and Q. Hence, it follows from Equation (2.18) 
that 

AK = -AU (2.21) 
for conservative forces. Another way of writing this is 

E = K + U = constant. (2.22) 

Of course, we recognize this as an energy conservation equation: E is the object's total 
energy, which is conserved; K is the energy the object has by virtue of its motion, otherwise 
know as its kinetic energy; and U is the energy the object has by virtue of its position, 
otherwise known as its potential energy. Note, however, that we can only write such energy 
conservation equations for conservative forces. Gravity is a good example of a conservative 
force. Non-conservative forces, on the other hand, do not conserve energy. In general, this 
is because of some sort of frictional energy loss which drains energy from the dynamical 
system whilst it remains in motion. Note that potential energy is undefined to an arbitrary 
additive constant. In fact, it is only the difference in potential energy between different 
points in space that is well-defined. 



2.6 Newton's Third Law of Motion 

Consider a system of N mutually interacting point objects. Let the ith object, whose mass 
is rru, be located at vector displacement i\. Suppose that this object exerts a force on the 
jth object. Likewise, suppose that the jth object exerts a force fy on the ith object. New- 
ton's third law of motion essentially states that these two forces are equal and opposite, 
irrespective of their nature. In other words, 

% = — fjt- (2.23) 

One corollary of Newton's third law is that an object cannot exert a force on itself. An- 
other corollary is that all forces in the Universe have corresponding reactions. The only 
exceptions to this rule are the fictitious forces which arise in non-inertial reference frames 
(e.g., the centrifugal and Coriolis forces which appear in rotating reference frames — see 
Chapter 7). Fictitious forces do not possess reactions. 

It should be noted that Newton's third law implies action at a distance. In other words, 
if the force that object i exerts on object j suddenly changes then Newton's third law de- 
mands that there must be an immediate change in the force that object j exerts on object 
i. Moreover, this must be true irrespective of the distance between the two objects. How- 
ever, we now know that Einstein's theory of relativity forbids information from traveling 
through the Universe faster than the velocity of light in vacuum. Hence, action at a dis- 
tance is also forbidden. In other words, if the force that object i exerts on object j suddenly 
changes then there must be a time delay, which is at least as long as it takes a light ray 
to propagate between the two objects, before the force that object j exerts on object t can 
respond. Of course, this means that Newton's third law is not, strictly speaking, correct. 
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However, as long as we restrict our investigations to the motions of dynamical systems 
on time-scales that are long compared to the time required for light-rays to traverse these 
systems, Newton's third law can be regarded as being approximately correct. 

In an inertial frame, Newton's second law of motion applied to the ith object yields 

d 2 r- i¥i 

m i -^=^f ij . (2.24) 

j=1,N 

Note that the summation on the right-hand side of the above equation excludes the case 
j = i, since the ith object cannot exert a force on itself. Let us now take the above equation 
and sum it over all objects. We obtain 

L m *-*£=L f * (2 - 25) 

1=1 ,N ij=1,N 

Consider the sum over forces on the right-hand side of the above equation. Each element 
of this sum — fy, say — can be paired with another element — fy, in this case — which is equal 
and opposite, according to Newton's third law. In other words, the elements of the sum 
all cancel out in pairs. Thus, the net value of the sum is zero. It follows that the above 
equation can be written 

r^u = 
dt 2 ' 

where M. = £_iL} m t * s trie total mass. The quantity r cm is the vector displacement of the 
center of mass of the system, which is an imaginary point whose coordinates are the mass 
weighted averages of the coordinates of the objects that constitute the system: i.e., 

r cm = ^ inUrt - (2.27) 

According to Equation (2.26), the center of mass of the system moves in a uniform straight- 
line, in accordance with Newton's first law of motion, irrespective of the nature of the 
forces acting between the various components of the system. 

Now, if the center of mass moves in a uniform straight-line then the center of mass 
velocity, 

dr cm ™* drt/dt 

dt Lt=i i^i 

is a constant of the motion. However, the momentum of the ith object takes the form 
Pi = mi dfi/dt. Hence, the total momentum of the system is written 
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A comparison of Equations (2.28) and (2.29) suggests that P is also a constant of the mo- 
tion. In other words, the total momentum of the system is a conserved quantity, irrespective 
of the nature of the forces acting between the various components of the system. This re- 
sult (which only holds if there is no net external force acting on the system) is a direct 
consequence of Newton's third law of motion. 

Taking the vector product of Equation (2.24) with the position vector r t , we obtain 



m i Y i X -JIT = Y- Y{ X fi '* ( 2 - 30 ) 

j=1,N 



However, it is easily seen that 



d 2 fi d( drA dli , 00 _ 

nkIiX ~dV = Tt miriX dt = dt' (2 - 31) 



where 



l i = m i r l x^ (2.32) 
at 

is the angular momentum of the ith particle about the origin of our coordinate system. The 
total angular momentum of the system (about the origin) takes the form 

L = Y_ l i ( 2 - 33 ) 

i=1,N 

Hence, summing Equation (2.30) over all particles, we obtain 

dL ^ 

y=i,N 

Consider the sum on the right-hand side of the above equation. A general term, x fy, 
in this sum can always be paired with a matching term, rj x f^, in which the indices have 
been swapped. Making use of Equation (2.23), the sum of a general matched pair can be 
written 

r t x % + rj x f ]{ = (r t - rj) x fiy (2.35) 

Let us assume that the forces acting between the various components of the system are 
central in nature, so that is parallel to r t — r,. In other words, the force exerted on 
object j by object i either points directly toward, or directly away from, object i, and vice 
versa. This is a reasonable assumption, since most of the forces which we encounter in the 
world around us are of this type (e.g., gravity). It follows that if the forces are central in 
nature then the vector product on the right-hand side of the above expression is zero. We 
conclude that 

rt x % + rj x % = 0, (2.36) 
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for all values of i and j. Thus, the sum on the right-hand side of Equation (2.34) is zero 
for any kind of central force. We are left with 



In other words, the total angular momentum of the system is a conserved quantity, provided 
that the different components of the system interact via central forces (and there is no net 
external torque acting on the system). 

2.7 Non-Isolated Systems 

Up to now, we have only considered isolated dynamical systems, in which all of the forces 
acting on the system originate within the system itself. Let us now generalize our approach 
to deal with non-isolated dynamical systems, in which some of the forces originate outside 
the system. Consider a system of N mutually interacting point objects. Let rru and r t be 
the mass and position vector of the ith object, respectively. Suppose that the ith object is 
subject to two forces. First, an internal force which originates from the other objects in the 
system, and second an external force which originates outside the system. In other words, 
let the force acting on the ith object take the form 



where f tj is the internal force exerted by object j on object i, and the net external force 
acting on object i. 

The equation of motion of the ith object is 




(2.37) 




(2.38) 



)=1,N 




j=1,N 



(2.39) 



Summing over all objects, we obtain 




(2.40) 



which reduces to 




(2.41) 



where 




(2.42) 



i=l,N 
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is the net external force acting on the system. Here, the sum over the internal forces has 
cancelled out in pairs due to Newton's third law of motion. We conclude that the total 
system momentum evolves in time according to the simple equation (2.41) when there 
is a net external force acting on the system, but is completely unaffected by the internal 
forces. The fact that Equation (2.41) is similar in form to Equation (2.13) suggests that 
the center of mass of a system of many point objects has analogous dynamics to a point 
object. 

Taking r { x Equation (2.39), and summing over all objects, we obtain 

dL 

— = T, (2.43) 
at 

where 

T = Y_ r i x p i (2-44) 

i=l,N 

is the net external torque acting on the system. Here, the sum over the internal torques 
has cancelled out in pairs, assuming that the internal forces are central in nature. We 
conclude that the total system angular momentum evolves in time according to the simple 
equation (2.43) when there is a net external torque acting on the system, but is completely 
unaffected by the internal torques. 

2.8 Exercises 

2.1. Consider an isolated system of N point objects interacting via gravity. Let the mass and 
position vector of the ith object be mi and r t , respectively. What is the vector equation 
of motion of the 1th object? Write expressions for the total kinetic energy, K, and potential 
energy, U, of the system. Demonstrate from the equations of motion that K + U is a conserved 
quantity. 

2.2. Consider a function of many variables f (xi , x 2 , • • • , x n ). Such a function which satisfies 

f (t X! , t X 2 , • • • , t X n ) = t Q f (X! , X 2 , • • • , X n ) 

for all t > 0, and all values of the x i; is termed a homogenous function of degree a. Prove the 
following theorem regarding homogeneous functions: 

v- 3f 

Xi 9x" = Qf 

t=1 ,n 

2.3. Consider an isolated system of N point objects interacting via attractive central forces. Let the 
mass and position vector of the ith object be and r^, respectively. Suppose that magnitude 
of the force exerted on object i by object j is lqkj — rj|~ n . Here, the 1q measure some 
constant physical property of the particles (e.g., their electric charges). Write an expression 
for the total potential energy U of the system. Is this a homogenous function? If so, what is 
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its degree? Write the equation of motion of the ith particle. Use the mathematical theorem 
from the previous exercise to demonstrate that 



where I = X.\=] n m i r i> an d K is the kinetic energy. This result is known as the virial 
theorem. Demonstrate that there are no bound steady-state equilibria for the system (i.e., 
states in which the global system parameters do not evolve in time) when n > 3. 

2.4. A star can be through of as a spherical system that consists of a very large number of particles 
interacting via gravity. Show that, for such a system, the virial theorem, introduced in the 
previous exercise, implies that 



where c is a constant, and the n are measured from the geometric center. Hence, deduce that 
the angular frequency of small amplitude radial pulsations of the star (in which the radial 
displacement is directly proportional to the radial distance from the center) takes the form 



where U and I are the equilibrium values of U and I. Finally, show that if the mass density 
within the star varies as r~ a , where r is the radial distance from the geometric center, and 
where a < 5/2, then 



where M and R are the stellar mass and radius, respectively. 

2.5. Consider a system of N point particles. Let the Ith particle have mass m^, electric charge 
qt, and position vector iv Suppose that the charge to mass ratio, qi/mi, is the same for all 
particles. The system is placed in a uniform magnetic field B. Write the equation of motion of 
the 1th particle. You may neglect any magnetic fields generated by the motion of the particles. 
Demonstrate that the total momentum P of the system precesses about B at the frequency 
D = q^B/mi. Demonstrate that Ly + niy/2 is a constant of the motion. Here, Ly is the 
total angular momentum of the system parallel to the magnetic field, and Iy is the moment 
of inertia of the system about an axis parallel to B which passes through the origin. 



1 d z l 

__ = 2K+ (n-,)U, 
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3 One-Dimensional Motion 



3.1 Introduction 

This chapter employs Newton's laws to investigate one-dimensional motion. Particular 
attention is given to the various mathematical techniques commonly used to analyze oscil- 
latory motion. 



3.2 Motion in a General One-Dimensional Potential 

Consider a point particle of mass m moving in the x-direction, say, under the action of 
some x-directed force f(x). Suppose that f(x) is a conservative force: e.g., gravity. In this 
case, according to Equation (2.19), we can write 

f (x) = — , (3.1) 

dx 

where U(x) is the potential energy of the particle at position x. 

Let the curve U(x) take the form shown in Figure 3.1. For instance, this curve might 
represent the gravitational potential energy of a cyclist freewheeling in a hilly region. 
Observe that we have set the potential energy at infinity to zero (which we are generally 
free to do, since potential energy is undefined to an arbitrary additive constant). This is a 
fairly common convention. What can we deduce about the motion of the particle in this 
potential? 

Well, we know that the total energy, E — which is the sum of the kinetic energy, K, and 
the potential energy, U — is a constant of the motion — see Equation (2.22). Hence, we can 
write 

K(x)=E-U(x). (3.2) 

However, we also know that a kinetic energy can never be negative [since K = (1/2) mv 2 , 
and neither m nor v 2 can be negative]. Hence, the above expression tells us that the 
particle's motion is restricted to the region (or regions) in which the potential energy curve 
U(x) falls below the value E. This idea is illustrated in Figure 3.1. Suppose that the total 
energy of the system is E . It is clear, from the figure, that the particle is trapped inside one 
or other of the two dips in the potential — these dips are generally referred to as potential 
wells. Suppose that we now raise the energy to E] . In this case, the particle is free to enter 
or leave each of the potential wells, but its motion is still bounded to some extent, since it 
clearly cannot move off to infinity. Finally, let us raise the energy to E 2 . Now the particle is 
unbounded: i.e., it can move off to infinity. In conservative systems in which it makes sense 
to adopt the convention that the potential energy at infinity is zero, bounded systems are 
characterized by E < 0, whereas unbounded systems are characterized by E > 0. 
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Xq Xi X2 

Figure 3.1: A potential energy curve. 



The above discussion suggests that the motion of an particle moving in a potential 
generally becomes less bounded as the total energy E of the system increases. Conversely, 
we would expect the motion to become more bounded as E decreases. In fact, if the energy 
becomes sufficiently small then it appears likely that the system will settle down in some 
equilibrium state in which the particle is stationary. Let us try to identify any prospective 
equilibrium states in Figure 3.1. If the particle remains stationary then it must be subject 
to zero force (otherwise it would accelerate). Hence, according to Equation (3.1), an 
equilibrium state is characterized by 



— - 
dx 



(3.3) 



In other words, a equilibrium state corresponds to either a maximum or a minimum of the 
potential energy curve U(x). It can be seen that the U(x) curve shown in Figure 3.1 has 
three associated equilibrium states located at x — Xq, X — X"| , and x = x 2 . 

Let us now make a distinction between stable equilibrium points and unstable equilib- 
rium points. When the particle is slightly displaced from a stable equilibrium point then 
the resultant force f acting on it must always be such as to return it to this point. In other 
words, if x = x is an equilibrium point then we require 



df 
dx 



< 



(3.4) 



"0 



for stability: i.e., if the particle is displaced to the right, so that x — x > 0, then the force 
must act to the left, so that f < 0, and vice versa. Likewise, if 



df 
dx 



>0 



(3.5) 



*0 



then the equilibrium point x = x is unstable. It follows, from Equation (3.1), that stable 
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equilibrium points are characterized by 

d 2 U 



dx 2 



> 0. (3.6) 



In other words, a stable equilibrium point corresponds to a minimum of the potential 
energy curve U(x). Likewise, an unstable equilibrium point corresponds to a maximum 
of the U(x) curve. Hence, we conclude that, in Figure 3.1, x = x and x = x 2 are stable 
equilibrium points, whereas x = X] is an unstable equilibrium point. Of course, this makes 
perfect sense if we think of U(x) as a gravitational potential energy curve, so that U is 
directly proportional to height. In this case, all we are saying is that it is easy to confine a 
low energy mass at the bottom of a valley, but very difficult to balance the same mass on 
the top of a hill (since any slight displacement of the mass will cause it to fall down the 
hill). Note, finally, that if 

— - — - (3 7) 

dx dx 2 

at any point (or in any region) then we have what is known as a neutral equilibrium 
point. We can move the particle slightly away from such a point and it will still remain 
in equilibrium (i.e., it will neither attempt to return to its initial state, nor will it continue 
to move). A neutral equilibrium point corresponds to aflat spot in a U(x) curve. See 
Figure 3.2. 

The equation of motion of an particle moving in one dimension under the action of a 
conservative force is, in principle, integrable. Since K = (1/2) mv 2 , the energy conserva- 
tion equation (3.2) can be rearranged to give 



where the ± signs correspond to motion to the left and to the right, respectively. However, 
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since v = dx/dt, this expression can be integrated to give 

1/2 r * dx / 



t = ± 



m 
2E 



x -U(x')/E' 



(3.9) 



where x(t = 0) = Xo. For sufficiently simple potential functions, U(x), the above equation 
can be solved to give x as a function of t. For instance, if U = (1/2) kx 2 , x = 0, and the 
plus sign is chosen, then 



m\ 1 
Y 



r(k/2E) 1 / 2 x 



m 



1/2 



sm 



2E 



1/2 



X 



which can be inverted to give 



x = a sin(cut), 



(3.10) 



(3.11) 



where a = ^2 E/k and o> = y / k/m. Note that the particle reverses direction each time it 
reaches one of the so-called turning points (x = ±a) at which U = E (and, so K = 0). 



3.3 Velocity Dependent Forces 



Consider a particle of mass m moving in one dimension under the action of a force, f, 
which is a function of the particle's speed, v, but not of its displacement, x. Note that 
such a force is intrinsically non-conservative [since it clearly cannot be expressed as minus 
the gradient of some potential function, U(x)]. Now, the particle's equation of motion is 
written 

dv 

f(v). (3.12) 



m 



dt 



Integrating this equation, we obtain 



' v dv' 



t 

m' 



(3.13) 



where v(t = 0) = v . In principle, the above equation can be solved to give v(t). The 
equation of motion is also written 



dv tr \ 
mv-=f(v), 

since v = dx/dt. Integrating this equation, we obtain 

rv v' dv' x - x 



(3.14) 



vo 



f(v') 



m 



(3.15) 



where x(t = 0) = x . In principle, the above equation can be solved to give v(x). 
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Let us now consider a specific example. Suppose that an object of mass m falls vertically 
under gravity. Let x be the height through which the object has fallen since t = 0, at which 
time the object is assumed to be at rest. It follows that x = v = 0. Suppose that, in 
addition to the force of gravity, m g, where g is the gravitational acceleration, our object 
is subject to a retarding air resistance force which is proportional to the square of its 
instantaneous velocity. The object's equation of motion is thus 



dv 7 
m— = mg -cv\ 
at 

where c > 0. This equation can be integrated to give 

dv' 

gt, 



(3.16) 



o i - iv/v t y 



(3.17) 



where v t = (mg/c) ' . Making a change of variable, we obtain 



■v/v t 



dy 



v t 



t. 



(3.18) 



The left-hand side of the above equation is now a standard integral, which can be solved 
to give 



v 



tanh- 1 ( - ) = ^, 



v t 



v t 



or 



v = v t tanh (^-^J . 



(3.19) 



(3.20) 



Thus, when t <C v t /g, we obtain the standard result v ~ g t, since tanhx ~ x for x 1 . 
However, when t 3> v t /g, we get v ~ v t , since tanhx ~ 1 for x 3> 1 . It follows that air 
resistance prevents the downward velocity of our object from increasing indefinitely as it 
falls. Instead, at large times, the velocity asymptotically approaches the so-called terminal 
velocity, v t (at which the gravitational and air resistance forces balance). 
The equation of motion of our falling object is also written 



mv 



dv 
dx 



m g — c v . 



(3.21) 



This equation can be integrated to give 



v'dv' 



i - (v/v t y 

Making a change of variable, we obtain 

"(V/V t ) 2 



gx. 



(3.22) 



1 



y x t 



(3.23) 
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where x t = m/(2c). The left-hand side of the above equation is now a standard integral, 
which can be solved to give 



-In 

or 





2" 


X 




)_ 


x t 


— e 


-<>*<)" 2 . 



(3.24) 



v = v t (1 -e- x/Xt J . (3.25) 

It follows that our object needs to fall a distance of order x t before it achieves its terminal 
velocity. 

Incidentally, it is quite easy to account for an air resistance force which scales as the 
square of projectile velocity. Let us imaging that our projectile is moving sufficiently rapidly 
that air does not have enough time to flow around it, and is instead simply knocked out 
of the way. If our projectile has cross-sectional area A, perpendicular to the direction of 
its motion, and is moving with speed v, then the mass of air that it knocks out of its way 
per second is p a Av, where p Q is the mass density of air. Suppose that the air knocked out 
of the way is pushed in the direction of the projectile's motion with a speed of order v. It 
follows that the air gains momentum per unit time p Q A v 2 in the direction of the projectile's 
motion. Hence, by Newton's third law, the projectile loses the same momentum per unit 
time in the direction of its motion. In other words, the projectile is subject to a drag force 
of magnitude 

T drag 

acting in the opposite direction to its motion. Here, C is an 0(1) dimensionless con- 
stant, known as the drag coefficient, which depends on the exact shape of the projectile. 
Obviously, streamlined projectiles, such as arrows, have small drag coefficients, whereas 
non-streamlined projectiles, such as bricks, have large drag coefficients. From before, the 
terminal velocity of our projectile is v t = (m g/c) 1/2 , where m is its mass, and c = C p a A. 
Writing m = A d p, where d is the typical linear dimension of the projectile, and p its mass 
density, we obtain 

The above expression tells us that large, dense, streamlined projectiles (e.g., medicine 
balls) tend to have large terminal velocities, and small, rarefied, non-streamlined projec- 
tiles (e.g., feathers) tend to have small terminal velocities. Hence, the former type of 
projectile is relatively less affected by air resistance than the latter. 



fdrag = Cp Q Av 2 (3.26) 



3.4 Simple Harmonic Motion 

Consider the motion of a point particle of mass m which is slightly displaced from a stable 
equilibrium point at x = 0. Suppose that the particle is moving in the conservative force- 
field f(x). According to the analysis of Section 3.2, in order for x = to be a stable 
equilibrium point we require both 

f (0) = 0, (3.28) 



One-Dimensional Motion 



27 



and 

^ < o. (3.29) 
dx 

Now, our particle obeys Newton's second law of motion, 

m^=f(x). (3.30) 

Let us assume that it always stays fairly close to its equilibrium point. In this case, to a 
good approximation, we can represent f(x) via a truncated Taylor expansion about this 
point. In other words, 

f(x) ~f(O) + -^x + 0(x 2 ). (3.31) 
dx 

However, according to (3.28) and (3.29), the above expression can be written 

f(x) ~ — mtUo x, (3.32) 

where df(0)/dx = — vtllvq. Hence, we conclude that our particle satisfies the following 
approximate equation of motion, 

^ + a> 2 x~0, (3.33) 

provided that it does not stray too far from its equilibrium point: i.e., provided |x| does not 
become too large. 

Equation (3.33) is called the simple harmonic equation, and governs the motion of 
all one-dimensional conservative systems which are slightly perturbed from some stable 
equilibrium state. The solution of Equation (3.33) is well-known: 

x(t) = a sin(tu t-4» ). (3.34) 

The pattern of motion described by above expression, which is called simple harmonic 
motion, is periodic in time, with repetition period T = 2n/uo , and oscillates between x = 
±a. Here, a is called the amplitude of the motion. The parameter fyo, known as the phase 
angle, simply shifts the pattern of motion backward and forward in time. Figure 3.3 shows 
some examples of simple harmonic motion. Here, 4) = 0, +7t/4, and — n/4 correspond to 
the solid, short-dashed, and long-dashed curves, respectively. 

Note that the frequency, cuo — and, hence, the period, T — of simple harmonic motion is 
determined by the parameters appearing in the simple harmonic equation, (3.33). How- 
ever, the amplitude, a, and the phase angle, <p , are the two integration constants of this 
second-order ordinary differential equation, and are, thus, determined by the initial con- 
ditions: i.e., by the particle's initial displacement and velocity. 

Now, from Equations (3.1) and (3.32), the potential energy of our particle at position 
x is approximately 

U(x) ~ ^ma) 2 x 2 . (3.35) 
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Figure 3.3: Simple harmonic motion. 
Hence, the total energy is written 

E = K + U=jm^j + jma) 2 x 2 , (3.36) 

giving 

E = - iucuq a cos (a>o t — $o) + -mcu a sin (a>o t — 4>o) — -vtilvq a , (3.37) 

where use has been made of Equation (3.34), and the trigonometric identity cos 2 + 
sin 2 = 1. Note that the total energy is constant in time, as is to be expected for a 
conservative system, and is proportional to the amplitude squared of the motion. 

3.5 Damped Oscillatory Motion 

According to Equation (3.34), a one-dimensional conservative system which is slightly 
perturbed from a stable equilibrium point (and then left alone) oscillates about this point 
with a fixed frequency and a constant amplitude. In other words, the oscillations never 
die away. This is not very realistic, since we know that, in practice, if we slightly perturb a 
dynamical system (such as a pendulum) from a stable equilibrium point then it will indeed 
oscillate about this point, but these oscillations will eventually die away due to frictional 
effects, which are present in virtually all real dynamical systems. In order to model this 
process, we need to include some sort of frictional drag force in our perturbed equation of 
motion, (3.33). 

The most common model for a frictional drag force is one which is always directed in 
the opposite direction to the instantaneous velocity of the object upon which it acts, and 
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is directly proportional to the magnitude of this velocity. Let us adopt this model. So, our 
drag force can be written 

dx 

fdrag = -2m v — , (3.38) 
dt 

where v is a positive constant with the dimensions of frequency. Including such a force in 
our perturbed equation of motion, (3.33), we obtain 

— + 2v — + o) 2 x = 0. (3.39) 
dt z dt 

Thus, the positive constant v parameterizes the strength of the frictional damping in our 
dynamical system. 

Equation (3.39) is a linear second-order ordinary differential equation, which we sus- 
pect possesses oscillatory solutions. There is a standard trick for solving such an equation. 
We search for complex solutions of the form 

x = ae- iw \ (3.40) 

where the constants cv and a are both, in general, complex. Of course, the physical solution 
is the real part of the above expression: i.e., 

x = |a| cos[arg(a) - Re(cu) t] e lm[w]t . (3.41) 

Clearly, the modulus and argument of the complex amplitude, a, determine the amplitude 
(at t = 0) and phase of the oscillation, respectively, whereas the real and imaginary parts of 
the complex frequency, w, determine its frequency and growth-rate, respectively. Note that 
this method of solution is only appropriate for linear differential equations. Incidentally, 
the method works because 

Re[£(x)] = £(Re[x]), (3.42) 

where x is a complex variable, and C some real linear differential operator which acts on 
this variable. [A linear operator satisfies £(qx) = aC[x] for all a and x, where a is a 
constant. The differential operator appearing in Equation (3.39) is clearly of this type.] 
Substituting Equation (3.40) into Equation (3.39), we obtain 



a 



— or — i2 v tu + cv 



e- ltut = 0, (3.43) 



which reduces to the following quadratic equation for w [since a exp(— i tut) ^ 0]: 

a> 2 + i2va>-a> 2 = 0. (3.44) 

The solution to this equation is 



uo± = -i y ± \Ja)% - v 2 . (3.45) 
Thus, the most general physical solution to Equation (3.39) is 

x(t) =Re [a+e- ia,+ t + a_e- iu, - t l , (3.46) 
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Figure 3.4: Damped oscillatory motion. 

where a± are two arbitrary complex constants. 

We can distinguish three different cases. In the first case, v < cu , and the motion is 
said to be underdamped. The most general solution is written 

x(t) =x e- vt cos(cu T t) + ( v ° + yx ° ) e ^t sin ( a , rt ) (3.47) 



where cv r = ^Jcvq —y 2 , xq = x(0), and vo = dx(0)/dt. It can be seen that the solution 
oscillates at some real frequency, a> r , which is somewhat less than the natural frequency 
of oscillation of the undamped system, w , but also decays exponentially in time at a rate 
proportional to the damping coefficient, v. 

In the second case, y = cvo, and the motion is said to be critically damped. The most 
general solution is written 

x(t) = [x (1 + o) 1) + vo t] e^ \ (3.48) 




It can be seen that the solution now decays without oscillating. 

In the third case, y > w , and the motion is said to be overdamped. The most general 
solution is written 

x(t) = - (^^) e-- t + (^^) e-- *, (3.49) 

where y± = y±^Jy 2 — coq. It can be seen that the solution again decays without oscillating, 
except there are now two independent decay rates. The largest, v+, is always greater than 
the critically damped decay rate, cu , whereas the smaller, v_, is always less than this decay 
rate. This means that, in general, the critically damped solution is more rapidly damped 
than either the underdamped or overdamped solutions. 
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Figure 3.4 shows typical examples of underdamped (i.e., y = o) /4), critically damped 
(i.e., "v = o) ), and overdamped (i.e., y = 4cu ) solutions, calculated with the initial 
conditions x = 1 and v = 0. Here, T = 2n/<jo Q . The three solutions correspond to the 
solid, short-dashed, and long-dashed curves, respectively. 



3.6 Quality Factor 

The total energy of a damped oscillator is the sum of its kinetic and potential energies: i.e., 

E = 2~ m (^t") + \ ma) ° xl - (3 - 50) 

Differentiating the above expression with respect to time, we obtain 

dE dx d 2 x j dx dx / d 2 x 7 \ ,„ _ , . 

— — = m- — — + mcuo x— = m— — T + cun x . (3.51) 
dt dt dt 2 dt dt \dt 2 J 

It follows from Equation (3.39) that 

_=- 2m ^_j . (3.52) 

We conclude that the presence of damping causes the oscillator energy to decrease mono- 
tonically in time, and, hence, causes the amplitude of the oscillation to eventually become 
negligibly small [see Equation (3.37)]. 

The energy loss rate of a weakly damped (i.e., y <C (x> ) oscillator is conveniently char- 
acterized in terms of a parameter, Q, which is known as the quality factor. This parameter 
is defined to be 2n times the energy stored in the oscillator, divided by the energy lost in a 
single oscillation period. If the oscillator is weakly damped then the energy lost per period 
is relatively small, and Q is therefore much larger than unity. Roughly speaking, Q is the 
number of oscillations that the oscillator typically completes, after being set in motion, 
before its amplitude decays to a negligible value. Let us find an expression for Q. 

Now, the most general solution for a weakly damped oscillator can be written in the 
form [cf., Equation (3.47)] 

x = x e~ yt cos(o) T t — 4) ), (3.53) 
where x and fy are constants, and cu r = ^Juoq — v 2 . It follows that 
dx 

— = — x ve yt cos(cu T t — $> ) — x o) r e yt sin(a) r t — 4) ). (3.54) 
dt 

Thus, making use of Equation (3.52), the energy lost during a single oscillation period is 



AE = 



= 2 m v X, 2 



rTr dE 

— dt (3.55) 
o dt 

2vt r„ „„„f /( . + i ... ^w,,. + a, m 2 



e [y cos(cu T t — 4> ) + cu r sin(a) T t — 4>o)] dt, 
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where T T = 2n/u) r . In the weakly damped limit, v <C cu r , the exponential factor is approx- 
imately unity in the interval t = to t = T T , so that 



cu r 

Thus, 



•271 

(v 2 cos 2 9 + 2 y cu r cos 9 sin 9 + cu 2 sin 2 9) d9. (3.56) 



AE ~ 2nmyx ° (v 2 + cu 2 ) = 27 tmcu 2 x 2 f — ) , (3.57) 

since cos 2 9 and sin 2 9 both have the average values 1 /2 in the interval to 2n, whereas 
cos 9 sin 9 has the average value 0. According to Equation (3.37), the energy stored in the 
oscillator (at t = 0) is 



1_ 

2 



E = ^mcu 2 x 2 . (3.58) 



It follows that 



3.7 Resonance 



E cu r cu . . 

Q = In — = — ~ — . (3.59) 
AE 2v 2v 



We have seen that when a one-dimensional dynamical system is slightly perturbed from a 
stable equilibrium point (and then left alone), it eventually returns to this point at a rate 
controlled by the amount of damping in the system. Let us now suppose that the same 
system is subject to continuous oscillatory constant amplitude external forcing at some 
fixed frequency, cu. In this case, we would expect the system to eventually settle down to 
some steady oscillatory pattern of motion with the same frequency as the external forcing. 
Let us investigate the properties of this type of driven oscillation. 
Suppose that our system is subject to an external force of the form 

f ext (t) = mtu 2 X! cos(tut). (3.60) 

Here, Xi is the amplitude of the oscillation at which the external force matches the restor- 
ing force, (3.32). Incorporating the above force into our perturbed equation of motion, 
(3.39), we obtain 

-^j + CU 2 X = CUqX! cos(tut). (3.61) 

Let us search for a solution of the form (3.40), and represent the right-hand side of the 
above equation as cu 2 X] exp(— i cu t). It is again understood that the physical solutions are 
the real parts of these expressions. Note that cu is now a real parameter. We obtain 



a 

Hence, 



-cu 2 -i2vcu + cu 2 ] e- ia,t = w^X, e- iwt . (3.62) 
cuo 2 X, 
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Figure 3.5: Resonance. 
In general, a is a complex quantity. Thus, we can write 



a = xi e 1 * 1 , 



(3.64) 



where Xi and cj)] are both real. It follows from Equations (3.40), (3.63), and (3.64) that 
the physical solution takes the form 



x(t) =X] cos(cut — 4h), 



where 



and 



Xi 



o) 2 Xi 



(cu 2 - u) 2 ) 2 +4v 2 tu 2 



1/2 ; 



4m = tan 



-i 



2v a) 



(3.65) 



(3.66) 



(3.67) 



We conclude that, in response to the applied sinusoidal force, (3.60), the system executes a 
sinusoidal pattern of motion at the same frequency, with fixed amplitude xi , and phase-lag 
cf)! (with respect to the external force). 

Let us investigate the variation of xi and tyi with the forcing frequency, o>. This is 
most easily done graphically. Figure 3.5 shows xi and 4>i as functions of w for various 
values of v/cu . Here, v/cu = 1, 1/2, 1/4, 1 /8, and 1 /1 6 correspond to the solid, dotted, 
short-dashed, long-dashed, and dot-dashed curves, respectively. It can be seen that as the 
amount of damping in the system is decreased, the amplitude of the response becomes pro- 
gressively more peaked at the natural frequency of oscillation of the system, cuo- This effect 
is known as resonance, and a>o is termed the resonant frequency. Thus, a weakly damped 
system (i.e., v a>o) can be driven to large amplitude by the application of a relatively 
small external force which oscillates at a frequency close to the resonant frequency. Note 
that the response of the system is in phase (i.e., 4)] ~ 0) with the external driving force 
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for driving frequencies well below the resonant frequency, is in phase quadrature (i.e., 
4>! = 7t/2) at the resonant frequency, and is in anti-phase (i.e., ~ 7t) for frequencies 
well above the resonant frequency. 
According to Equation (3.66), 

= 2* = Q. (3.68) 

In other words, the ratio of the driven amplitude at the resonant frequency to that at 
a typical non-resonant frequency (for the same drive amplitude) is of order the quality 
factor. Equation (3.66) also implies that, for a weakly damped oscillator (y < co ), 

Xl(a)) - y (3.69) 



x 1 (a) = tu ) [(a)-a) ) 2 + v 2 ]V2' 

provided |cu — tuol «C oj . Hence, the width of the resonance peak (in frequency) is Aw = 
2 v, where the edges of peak are defined as the points at which the driven amplitude is 
reduced to l/v 7 ^ of its maximum value. It follows that the fractional width is 

Atu 2v 1 

= — = 7v (3.70) 

We conclude that the height and width of the resonance peak of a weakly damped (Q > 1 ) 
oscillator scale as Q and Q 1 , respectively. Thus, the area under the resonance curve stays 
approximately constant as Q varies. 

3.8 Periodic Driving Forces 

In the last section, we investigated the response of a one-dimensional dynamical system, 
close to a stable equilibrium point, to an external force which varies as cos (out). Let us 
now examine the response of the same system to a more complicated external force. 

Consider a general external force which is periodic in time, with period T. By analogy 
with Equation (3.60), we can write such a force as 

fext(t) =mcu 2 X(t), (3.71) 

where 

X(t + T)=X(t) (3.72) 

for all t. 

It is convenient to represent X(t) as a Fourier series in time, so that 

oo 

X(t) = ^X n cos(ncut), (3.73) 

n=0 
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where cu = 2n/T. By writing X(t) in this form, we automatically satisfy the periodicity 
constraint (3.72). [Note that by choosing a cosine Fourier series we are limited to even 
functions in t: i.e., X(— t) = X(t). Odd functions in t can be represented by sine Fourier 
series, and mixed functions require a combination of cosine and sine Fourier series.] The 
constant coefficients X n are known as Fourier coefficients. But, how do we determine these 
coefficients for a given functional form, X(t)? 

Well, it follows from the periodicity of the cosine function that 

1 f T 

- cos(na)t) dt = 5 n0 , (3.74) 
T Jo 

where 5 nn / is unity if n = u', and zero otherwise, and is known as the Kronecker delta 
function. Thus, integrating Equation (3.73) in t from t = to t = T, and making use of 
Equation (3.74), we obtain 

1 <- J 



Xo = T 



X(t) dt. (3.75) 

o 



It is also easily demonstrated that 

2 rT 

T 



cos(n cu t) cos(n' cu t) dt = 6 nn /, (3.76) 



provided n, n' > 0. Thus, multiplying Equation (3.73) by cos(n cu t), integrating in t from 
t = to t = T, and making use of Equations (3.74) and (3.76), we obtain 



2 rT 



X n = - 



X(t) cos(ncut) dt (3.77) 

o 



for n > 0. Hence, we have now determined the Fourier coefficients of the general periodic 
function X(t). 

We can incorporate the periodic external force (3.71) into our perturbed equation of 
motion by writing 

^ + 2 y ^ + cu 2 x = cu 2 f_ Xn e™ t , (3.78) 

n=0 

where we are again using the convention that the physical solution corresponds to the 
real part of the complex solution. Note that the above differential equation is linear. This 
means that if x Q (t) and x b (t) represent two independent solutions to this equation then 
any linear combination of x a (t) and Xb(t) is also a solution. We can exploit the linearity of 
the above equation to write the solution in the form 

oo 

x(t) = ^X n a n e- inw \ (3.79) 

where the a n are the complex amplitudes of the solutions to 

d 2 x „ dx 



+ 2v^ + cu 2 x = cu 2 e- inu,t . (3.80) 



dt 2 dt 
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In other words, a n is obtained by substituting x = a n exp(— incut) into the above equa- 
tion. Hence, it follows that 



tu 2 



a n = — 22 -, . (3.81) 

<x>q — n z a) 1 — 1 2 y n cu 

Thus, the physical solution takes the form 

oo 

x(t) = ^X n x n cos(ncut- 4> n ), (3.82) 

n=0 

where 

a n = x n e i * n , (3.83) 
and x n and 4> n are real parameters. It follows from Equation (3.81) that 



x n 



> 2 - n 2 tu 2 ) 2 + 4v 2 n 2 tu 2 l V2 ' 



(3.84) 



and 

^ = tan-'(4^). (3.85) 

We have now fully determined the response of our dynamical system to a general periodic 
driving force. 

As an example, suppose that the external force periodically delivers a brief kick to the 
system. For instance, let X(t) = A for < t < T/10 and 9T/10 < t < T, and X(t) = 
otherwise (in the period < t < T). It follows from Equation (3.75) and (3.77) that, in 
this case, 

X =0.2A, (3.86) 

and 

X„ = 2si " (n7l/5)A , (3.87) 
n7t 

for n > 0. Obviously, to obtain an exact solution, we would have to include every Fourier 
harmonic in Equation (3.82), which would be impractical. However, we can obtain a fairly 
accurate approximate solution by truncating the Fourier series (i.e., by neglecting all the 
terms with n > N, where N > 1). 

Figure 3.6 shows an example calculation in which the Fourier series is truncated after 
100 terms. The parameters used in this calculation are lv = 1.2 <jo and v = 0.8 lv q . The 
left panel shows the Fourier reconstruction of the driving force, X(t). The glitches at the 
rising and falling edges of the pulses are called Gibbs phenomena, and are an inevitable 
consequence of attempting to represent a discontinuous periodic function as a Fourier 
series. The right panel shows the Fourier reconstruction of the response, x(t), of the 
dynamical system to the applied force. 
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3.9 Transients 

We saw, in Section 3.7, that when a one-dimensional dynamical system, close to a stable 
equilibrium point, is subject to a sinusoidal external force of the form (3.60) then the 
equation of motion of the system is written 

cl^x. dx. 

+ 2y— + cu 2 x = cu 2 X! cos(cut). (3.88) 

We also found that the solution to this equation which oscillates in sympathy with the 
applied force takes the form 

X(t) = Xt COS(O) t - (()! ), (3.89) 

where X] and 4^ are specified in Equations (3.66) and (3.67), respectively. However, 
(3.89) is not the most general solution to Equation (3.88). It should be clear that we can 
take the above solution and add to it any solution of Equation (3.88) calculated with the 
right-hand side set to zero, and the result will also be a solution of Equation (3.88). Now, 
we investigated the solutions to (3.88) with the right-hand set to zero in Section 3.5. In 
the underdamped regime (y < cu ), we found that the most general such solution takes 
the form 

x(t) =Ae" vt cos(cu T t) + Be^ 1 sin(co T t), (3.90) 

where A and B are two arbitrary constants [they are in fact the integration constants of 
the second-order ordinary differential equation (3.88)], and cu T = ^/cu 2 — v 2 . Thus, the 
most general solution to Equation (3.88) is written 

x(t) = Ae~ vt cos(cu T t) +Be" vt sin(cu T t) +xi cos(out - fa). (3.91) 

The first two terms on the right-hand side of the above equation are called transients, since 
they decay in time. The transients are determined by the initial conditions. However, if 
we wait long enough after setting the system into motion then the transients will always 
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Figure 3.7: Transients. 



decay away, leaving the time-asymptotic solution (3.89), which is independent of the initial 
conditions. 

As an example, suppose that we set the system into motion at time t = with the initial 
conditions x(0) = dx(0)/dt = 0. Setting x(0) = in Equation (3.91), we obtain 

A = -X! cos 4m. (3.92) 

Moreover, setting dx(0)/dt = in Equation (3.91), we get 

B = xi (v cos 4)i +w sin 4)i ) 

a> r 

Thus, we have now determined the constants A and B, and, hence, fully specified the 
solution for t > 0. Figure 3.7 shows this solution (solid curve) calculated for w = 2tu 
and v = 0.2 cu . Here, T = 2n/a) . The associated time-asymptotic solution (3.89) is also 
shown for the sake of comparison (dashed curve). It can be seen that the full solution 
quickly converges to the time-asymptotic solution. 



3.10 Simple Pendulum 

Consider a compact mass m suspended from a light inextensible string of length I, such 
that the mass is free to swing from side to side in a vertical plane, as shown in Figure 3.8. 
This setup is known as a simple pendulum. Let be the angle subtended between the string 
and the downward vertical. Obviously, the stable equilibrium state of the simple pendulum 
corresponds to the situation in which the mass is stationary, and hangs vertically down (i.e., 
= 0). From elementary mechanics, the angular equation of motion of the pendulum is 

d 2 

I -j = t, (3.94) 
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Figure 3.8: A simple pendulum. 



where I is the moment of inertia of the mass (see Section 8.3), and t is the torque acting 
about the pivot point (see Section A. 7). For the case in hand, given that the mass is 
essentially a point particle, and is situated a distance I from the axis of rotation (i.e., the 
pivot point), it is easily seen that I = ml 2 . 

The two forces acting on the mass are the downward gravitational force, m g, where g 
is the acceleration due to gravity, and the tension, T, in the string. Note, however, that the 
tension makes no contribution to the torque, since its line of action clearly passes through 
the pivot point. From simple trigonometry the line of action of the gravitational force 
passes a distance I sin from the pivot point. Hence, the magnitude of the gravitational 
torque is mgl sin 6. Moreover, the gravitational torque is a restoring torque: i.e., if the 
mass is displaced slightly from its equilibrium state (i.e., 9 = 0) then the gravitational 
torque clearly acts to push the mass back toward that state. Thus, we can write 



Combining the previous two equations, we obtain the following angular equation of mo- 
tion of the pendulum: 



Note that, unlike all of the other equations of motion which we have examined in this 
chapter, the above equation is nonlinear [since sin(a9) ^ a sin 9, except when 9 1]. 

Let us assume, as usual, that the system does not stray very far from its equilibrium 
state (9 = 0). If this is the case then we can make the small angle approximation sin 9 ~ 9, 
and the above equation of motion simplifies to 



t = —mgl sin 9. 



(3.95) 




+ g sin 9 = 0. 



(3.96) 



d 2 9 



+ a> 2 9 ~ 0, 



(3.97) 



dt 2 
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where cu = y g/l- Of course, this is just the simple harmonic equation. Hence, we can 
immediately write the solution as 



9(t) = 9 cos(cu t). 



(3.98) 



We conclude that the pendulum swings back and forth at a fixed frequency, cu , which 
depends on I and g, but is independent of the amplitude, 9 , of the motion. 

Suppose, now, that we desire a more accurate solution of Equation (3.96). One way in 
which we could achieve this would be to include more terms in the small angle expansion 
of sin 9, which is 

9 3 9 5 

sin9 = 9 -_ + _ + .... (3.99) 
For instance, keeping the first two terms in this expansion, Equation (3.96) becomes 

d 2 9 



dt 2 



+ o) z (9-976) ~0. 



(3.100) 



By analogy with (3.98), let us try a trial solution of the form 

9(t) =-&o cos(cut). (3.101) 
Substituting this into Equation (3.100), and making use of the trigonometric identity 

cos 3 u = (3/4) cos it + (1/4) cos(3u), (3.102) 

we obtain 



cu 2 -cu 2 - (1/8)cu 2 # 2 cos (cut) - (1/24) cu 2 £ 3 cos (3 cut) ~ 0. 



2 q3 



(3.103) 



It is evident that the above equation cannot be satisfied for all values of t, except in the 
trivial case -& = 0. However, the form of this expression does suggest a better trial solution, 
namely 

9(t) =£ cos(cut) + ado cos(3cut), (3.104) 
where cx is (9(1 ). Substitution of this expression into Equation (3.100) yields 



cu; 



cu 2 -(1/8)cu 2 £ 2 



cos(cu t)+ 



^0 



<xcu 2 -9cxcu 2 - (1/24) cu 2 cos(3 cu t) + 0(£ o 5 ) 



0. 



(3.105) 



We can only satisfy the above equation at all values of t, and for non-zero d , by setting 
the two expressions in square brackets to zero. This yields 



cu^aWl-n/SR 2 , 



and 



a ~ 



192' 



(3.106) 
(3.107) 
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Now, the amplitude of the motion is given by 



e o =S + Cx£ 3 =£o-:p^o 3 - 



(3.108) 



Hence, Equation (3.106) simplifies to 

r o2 



(3.109) 



The above expression is only approximate, but it illustrates an important point: i.e., that 
the frequency of oscillation of a simple pendulum is not, in fact, amplitude independent. 
Indeed, the frequency goes down slightly as the amplitude increases. 

The above example illustrates how we might go about solving a nonlinear equation of 
motion by means of an expansion in a small parameter (in this case, the amplitude of the 
motion). 

3.11 Exercises 

3.1. If a train of mass M is subject to a retarding force M (a + b v 2 ), show that if the engines are 
shut off when the speed is v then the train will come to rest in a time 



3.2. A particle is projected vertically upward from the Earth's surface with a velocity which would, 
if gravity were uniform, carry it to a height h. Show that if the variation of gravity with height 
is allowed for, but the resistance of air is neglected, then the height reached will be greater 
by h 2 /(R — h), where R is the Earth's radius. 

3.3. A particle is projected vertically upward from the Earth's surface with a velocity just sufficient 
for it to reach infinity (neglecting air resistance). Prove that the time needed to reach a height 



where R is the Earth's radius, and g its surface gravitational acceleration. 

3.4. A particle of mass m is constrained to move in one dimension such that its instantaneous 
displacement is x. The particle is released at rest from x = b, and is subject to a force of the 
form f (x) = — kx~ 2 . Show that the time required for the particle to reach the origin is 




after traveling a distance 




h, is 
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3.5. A block of mass m slides along a horizontal surface which is lubricated with heavy oil such 
that the block suffers a viscous retarding force of the form 



where c > is a constant, and v is the block's instantaneous velocity. If the initial speed is vo 
at time t = 0, find v and the displacement x as functions of time t. Also find v as a function 

3 /2 

of x. Show that for n = 1 /2 the block does not travel further than 2 mv /(3 c). 

3.6. A particle is projected vertically upward in a constant gravitational held with an initial speed 
v . Show that if there is a retarding force proportional to the square of the speed then the 
speed of the particle when it returns to the initial position is 



where v t is the terminal speed. 

3.7. A particle of mass m moves (in one dimension) in a medium under the influence of a retard- 
ing force of the form rak (v 3 + a 2 v), where v is the particle speed, and k and a are positive 
constants. Show that for any value of the initial speed the particle will never move a distance 
greater than 7t/(2k a), and will only come to rest as t — > oo. 

3.8. Two light springs have spring constants ki and k 2 , respectively, and are used in a vertical 
orientation to support an object of mass m. Show that the angular frequency of oscillation is 
[(k] + k 2 )/m]^ 2 if the springs are in parallel, and [ki k2/(ki + k 2 ) m] 1 / 2 if the springs are in 
series. 

3.9. A body of uniform cross-sectional area A and mass density p floats in a liquid of density po 
(where p < po), and at equilibrium displaces a volume V. Show that the period of small 
oscillations about the equilibrium position is 



3.10. Show that the ratio of two successive maxima in the displacement of a damped harmonic 
oscillator is constant. 

3.11. If the amplitude of a damped harmonic oscillator decreases to 1/e of its initial value after n 
periods show that the ratio of the period of oscillation to the period of the same oscillator 
with no damping is 



F = -, 



cv 



,n 



V V t 






3.12. Consider a damped driven oscillator whose equation of motion is 



d 2 x „ dx 7 T _, , 



Let x = and dx/dt = vo at t = 0. 
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(a) Find the solution for t > when F = sin(tu t). 

(b) Find the solution for t > when F = sin (tu t). 

3.13. Obtain the time asymptotic response of a damped linear oscillator of natural frequency tuo 
and damping coefficient y to a square-wave periodic forcing function of amplitude Fo = 
ttovq Xo and frequency tu. Thus, F(t) = F for —n/2 < tut < n/2, 3n/2 < tut < 5n/2, etc., 
and F(t) = — F for n/2 < tu t < 3n/2, 5n/2 < tu t < 7n/2, etc. 
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4 Multi-Dimensional Motion 



4. 1 Introduction 

This chapter employs Newton's laws to investigate various aspects of multi-dimensional 
motion. 

4.2 Motion in a Two-Dimensional Harmonic Potential 

Consider a particle of mass m moving in the two-dimensional harmonic potential 

U(x,y) = lkr 2 , (4.1) 



where r = y x 2 + y 2 , and k > 0. It follows that the particle is subject to a force, 

f = - VU = -k (x, y ) = -k r, (4.2) 

which always points towards the origin, and whose magnitude increases linearly with in- 
creasing distance from the origin. According to Newton's second law, the equation of 
motion of the particle is 

d 2 r 

m- T =f=-kr. (4.3) 
dt 2 

When written in component form, the above equation reduces to 

d 2 x 

= -^0*, (4-4) 
^ = -wou, (4.5) 

where w = ^/k/m. 

Since Equations (4.4) and (4.5) are both simple harmonic equations, we can immedi- 
ately write their general solutions: 

x = A cos(cu t — 4^), (4.6) 
y = B cos(cu t- <J> 2 ). (4.7) 

Here, A, B, (J)], and $2 are arbitrary constants of integration. We can simplify the above 
equations slightly by shifting the origin of time (which is, after all, arbitrary): i.e., 

t-^t' + cfr/cuo. (4.8) 
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Hence, we obtain 



x = A cos(o) t' ), 
u = B cos(tu t' — A), 



(4.9) 
(4.10) 



where A = 4) 2 — . Note that the motion is clearly periodic in time, with period T = 2 n/iv . 
Thus, the particle must trace out some closed trajectory in the x-y plane. The question, now, 
is what does this trajectory look like as a function of the relative phase-shift, A, between 
the oscillations in the x- and y -directions? 

Using standard trigonometry we can write Equation (4.10) in the form 



y = B [cos(o) t') cos A + sm{<ju Q t') sin A] . 
Hence, using Equation (4.9), we obtain 



y x 

f- — - cos A 
B A 



sin 2 (cu t') sin 2 A 



sin 2 A, 



which simplifies to give 



2 2 

x „ xu u 
—-j — 2 —7- cosA+^r 
A 2 A B B 2 



sin A. 



(4.11) 



(4.12) 



(4.13) 



Unfortunately, the above equation is not immediately recognizable as being the equation 
of any particular geometric curve: e.g., a circle, an ellipse, or a parabola, etc. 

Perhaps our problem is that we are using the wrong coordinates. Suppose that we 
rotate our coordinate axes about the z-axis by an angle 9, as illustrated in Figure A. 5. 
According to Equations (A. 17) and (A. 18), our old coordinates (x, y) are related to our 
new coordinates (%', y ') via 



x = x' cos0 — y' sin0, 
y = x' sin9 + y' cos0. 



(4.14) 
(4.15) 



Let us see whether Equation (4.13) takes a simpler form when expressed in terms of our 
new coordinates. Equations (4.13)-(4.15) yield 




+xV 



A 2 ' AB 
2 sin 9 cos 9 2 (sin 2 9 — cos 2 91 cos A 



B 2 

2 cos 9 sin 9 



(4.16) 



sin A. 



A 2 AB B 2 

We can simplify the above equation by setting the term involving x'y' to zero. Hence, 

sin(2 9) 2cos(2 9)cosA sin(2 0) 



A 2 



AB 



+ 



B 2 



= 0, 



(4.17) 
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where we have made use of some simple trigonometric identities. Thus, the x'y' term 
disappears when takes the special value 



9 



1 /2AB cosA^ 

2 tan ~ A 2 -B 2 j 



(4.18) 



In this case, Equation (4.16) reduces to 



,/2 



12 



b 2 



1, 



where 



a z 

b 2 



sin 2 A 



cos 2 2 cos 9 sin 9 cos A 



1 



sin A 



A 2 

sin 2 9 
^ 2_ 



AB 



+ 



sin 2 9 



B 2 



2 cos 9 sin 9 cos A cos 2 9 
+ — + 



AB 



B 2 



(4.19) 

(4.20) 
(4.21) 



Of course, we immediately recognize Equation (4.19) as the equation of an ellipse, centered 
on the origin, whose major and minor axes are aligned along the x'- and u'-axes, and 
whose major and minor radii are a and b, respectively (assuming that a > b). 

We conclude that, in general, a particle of mass m moving in the two-dimensional 
harmonic potential (4.1) executes a closed elliptical orbit (which is not necessarily aligned 
along the x- and u-axes), centered on the origin, with period T = Itc/uoq, where uoq = 
k/m. 

Figure 4.1 shows some example trajectories calculated for A = 2, B = 1, and the 
following values of the phase difference, A: (a) A = 0; (b) A = n/4; (c) A = n/2; (d) 
A = 3n/4. Note that when A = the trajectory degenerates into a straight-line (which can 
be thought of as an ellipse whose minor radius is zero) . 

Perhaps, the main lesson to be learned from the above study of two-dimensional motion 
in a harmonic potential is that comparatively simple patterns of motion can be made to 
look complicated when expressed in terms of ill-chosen coordinates. 



4.3 Projectile Motion with Air Resistance 

Suppose that a projectile of mass m is launched, at t = 0, from ground level (in a flat 
plain), making an angle 9 to the horizontal. Suppose, further, that, in addition to the force 
of gravity, the projectile is subject to an air resistance force which acts in the opposite direc- 
tion to its instantaneous direction of motion, and whose magnitude is directly proportional 
to its instantaneous speed. This is not a particularly accurate model of the drag force due 
to air resistance (the magnitude of the drag force is typically proportion to the square of 
the speed — see Section 3.3), but it does lead to tractable equations of motion. Hence, by 
using this model we can, at least, get some idea of how air resistance modifies projectile 
trajectories. 
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Figure 4.1: Trajectories in a two-dimensional harmonic oscillator potential. 



Let us adopt a Cartesian coordinate system whose origin coincides with the launch 
point, and whose z-axis points vertically upward. Let the initial velocity of the projectile 
lie in the x-z plane. Note that, since neither gravity nor the drag force cause the projectile 
to move out of the x-z plane, we can effectively ignore the y coordinate in this problem. 

The equation of motion of our projectile is written 

dv 

m — — = rrt g c v, (4.22) 
at 

where v = (v x , v z ) is the projectile velocity, g = (0, — g) the acceleration due to gravity, and 
c a positive constant. In component form, the above equation becomes 

dv x 



dt 
dv z 
"dT 



= -g— > 
v t 



-9 1 



v 7 



v t 



(4.23) 
(4.24) 



Here, v t = m g/c is the terminal velocity: i.e., the velocity at which the drag force balances 
the gravitational force (for a projectile falling vertically downward) . 
Integrating Equation (4.23), we obtain 

rv dv. 



V t 



where v x0 = v cos is the x-component of the launch velocity. Hence, 

9 



In 



v x0 



v t 



t, 



(4.25) 



(4.26) 
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or 

v x =v cos6e- gt/Vt . (4.27) 

It is clear, from the above equation, that air drag causes the projectile's horizontal velocity, 
which would otherwise be constant, to decay exponentially on a time-scale of order v t /g. 
Integrating Equation (4.24), we get 

" Vz dv z q 

— =— H-t, (4.28) 

v z0 V t +V z V t 

where v z0 = v sin is the z-component of the launch velocity. Hence, 

ln (^+^\ _9 (4 29) 
Vv t + v z o/ v t 

or 

v z = v sin 9 e-9 t/Vt - v t (l - e- gt/Vt ) . (4.30) 

It thus follows, from Equations (4.27) and (4.30), that if the projectile stays in the air 
much longer than a time of order v t /g then it ends up falling vertically downward at the 
terminal velocity, v t , irrespective of its initial launch angle. 
Integration of (4.27) yields 

x = v v t cos9 (l _ e _ 9t/vt x 
9 V 1 
In the limit t < v t /g, the above equation reduces to 

x = v cos 9 1, (4.32) 

which is the standard result in the absence of air drag. In the opposite limit, t > v t /g, we 
get 

x = Y^l^l. (4.33) 

g 

The above expression clearly sets an effective upper limit on how far the projectile can 
travel in the horizontal direction. 
Integration of (4.30) gives 

z = — (v sin9+v t ) fl -e- gt/Vt ) -v t t. (4.34) 
g v ' 

In the limit t < v t /g, this equation reduces to 

z = v sin9t- |t 2 , (4.35) 

which is the standard result in the absence of air drag. In the opposite limit, t > v t /g, we 
get 

z = ^1 (v sin 9 +v t ) -v t t. (4.36) 

g 
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Incidentally, the above analysis implies that air resistance only starts to have an appreciable 
effect on the trajectory after the projectile has been in the air a time of order v t /g. 

It is clear, from the previous two equations, that the time of flight of the projectile (i.e., 
the time at which z = 0, excluding the trivial result t = 0) is 

t f = (4.37) 
9 

when t < v t /g, which implies that v sin 9 < v t , and 

v sin0 

t f = — (4.38) 

9 

when t 3> v t /g, which implies that v sin 9 ^> v t (i.e., the vertical component of the launch 
velocity is much greater than the terminal velocity). It thus follows, from Equations (4.32) 
and (4.33), that the horizontal range [i.e., x(t f )] of the projectile is 

„ v 2 sin(2 9) , AnM 
R = — — - (4.39) 



when v sin 9 < v t , and 



R = VoVt C ° S9 (4.40) 



when vo sin 9 3> v t . Equation (4.39) is, of course, the standard result without air resis- 
tance. This result implies that, in the absence of air resistance, the maximum horizontal 
range, vfi/g, is achieved when the launch angle 9 takes the value 45°. On the other hand, 
Equation (4.40) implies that, in the presence of air resistance, the maximum horizontal 
range, v v t /g, is achieved when 9 is made as small as possible. However, 9 cannot be 
made too small, since expression (4.40) is only valid when vo sin 9 > v t . In fact, assuming 
that vo 3> v t , the maximum horizontal range, v v t /g, is achieved when 9 ~ v t /v <C 1. 
We thus conclude that if air resistance is significant then it causes the horizontal range 
of the projectile to scale linearly, rather than quadratically with the launch velocity, v . 
Moreover, the maximum horizontal range is achieved with a launch angle which is much 
shallower than the standard result, 45°. 

Figure 4.2 shows some example trajectories calculated, from the above model, with the 
same launch angle, 45°, but with different values ofthe ratio v /v t . Here, X = x/(v 2 /g) and 
Z = z/(vo/g). The solid, short-dashed, long-dashed, and dot-dashed curves correspond to 
Vo/vt = 0, 1, 2, and 4, respectively. It can be seen that as the air resistance strength 
increases (i.e., as vo/v t increases), the range ofthe projectile decreases. Furthermore, there 
is always an initial time interval during which the trajectory is identical to that calculated 
in the absence of air resistance (i.e., v /v t = 0). Finally, in the presence of air resistance, 
the projectile tends to fall more steeply than it rises. Indeed, in the presence of strong air 
resistance (i.e., v /v t = 4), the projectile falls almost vertically. 
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Figure 4.2: Projectile trajectories in the presence of air resistance. 

4.4 Charged Particle Motion in Electric and Magnetic Fields 

Consider a particle of mass m and electric charge q moving in the uniform electric and 
magnetic fields, E and B. Suppose that the fields are "crossed" (i.e., perpendicular to one 
another), so that E • B = 0. 

The force acting on the particle is given by the familiar Lorentz law: 

f = q (E + vx B), (4.41) 

where v is the particle's instantaneous velocity. Hence, from Newton's second law, the 
particle's equation of motion can be written 

dv 

m— = q (E + vxB). (4.42) 
dt 

It turns out that we can eliminate the electric field from the above equation by trans- 
forming to a different inertial frame. Thus, writing 

v=^|^+v\ (4.43) 

Equation (4.42) reduces to 

dv' 

m— — = q v x B, (4.44) 
at 

where we have made use of a standard vector identity (see Section A. 10), as well as the 

fact that E B = 0. Hence, we conclude that the addition of an electric field perpendicular to 
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a given magnetic field simply causes the particle to drift perpendicular to both the electric 
and magnetic field with the fixed velocity 

Veb = (4.45) 

irrespective of its charge or mass. It follows that the electric field has no effect on the 
particle's motion in a frame of reference which is co-moving with the so-called E-cross-B 
velocity given above. 

Let us suppose that the magnetic field is directed along the z-axis. As we have just seen, 
in the E x B frame, the particle's equation of motion reduces to Equation (4.44), which can 
be written: 



dv; 
"dt" 



Civ' (4.46) 



dv' 

= - a <> W.47) 

aX 
dt 



0. (4.48) 



Here, 



O = ^ (4.49) 
m 

is the so-called cyclotron frequency. Equations (4.46)-(4.48) can be integrated to give 

v' x = vi sin(Ot), (4.50) 
v; = vi cos(Qt) (4.51) 
v' z = v,|, (4.52) 

where we have judiciously chosen the origin of time so as to eliminate any phase offset 
in the arguments of the above trigonometrical functions. According to Equations (4.50)- 
(4.52), in the E x B frame, our charged particle gyrates at the cyclotron frequency in the 
plane perpendicular to the magnetic field with some fixed speed v ± , and drifts parallel 
to the magnetic field with some fixed speed vy. The fact that the cyclotron frequency 
is positive for positively charged particles, and negative for negatively charged particles, 
just means that oppositely charged particles gyrate in opposite directions in the plane 
perpendicular to the magnetic field. 

Equations (4.50)-(4.52) can be integrated to give 

x' = -pcos(Ot), (4.53) 
y' = psin(Ot) (4.54) 
z' = vnt, (4.55) 
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where we have judiciously chosen the origin of our coordinate system so as to eliminate 
any constant offsets in the above equations. Here, 

P = ^ (4.56) 

is called the Larmor radius. Equations (4.53)-(4.55) are the equations of a spiral of radius 
p, aligned along the direction of the magnetic field (i.e., the z-direction) . 




Figure 4.3: The spiral trajectory of a negatively charged particle in a magnetic field. 

We conclude that the general motion of a charged particle in crossed electric and mag- 
netic field is a combination of E x B drift [see Equation (4.45)] and spiral motion aligned 
along the direction of the magnetic field — see Figure 4.3. Particles drift parallel to the 
magnetic field with constant speeds, and gyrate at the cyclotron frequency in the plane 
perpendicular to the magnetic field with constant speeds. Oppositely charged particles 
gyrate in opposite directions. 

4.5 Exercises 

4.1. An electron of mass m and charge — e moves in a uniform y -directed electric field of magni- 
tude E, and a uniform z-directed magnetic field of magnitude B. The electron is situated at 
the origin at t = with an initial x-directed velocity of magnitude vo. Show that the electron 
traces out a cycloid of the general form 

x = a sin(o)t) + bt, 
y = c [1 — cos(tut)], 
z = 0. 

Find the values of a, b, c, and cu, and sketch the electron's trajectory in the x-y plane when 

v < E/B, E/B < v < 2 E/B, and v > 2 E/B. 

4.2. A particle of mass m and charge q moves in the x-y plane under the influence of a constant 
amplitude rotating electric field which is such that E x = Eo cos(cut) and E y = Eo sin(cut). 
The particle starts at rest from the origin. Determine its subsequent motion. What shape is 
the particle's trajectory? 
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4.3. A particle of mass m slides on a frictionless surface whose height is a function of x only: i.e., 
z = z{x). The function z(x) is specified by the parametric equations 

x = A [2$ + sin (2 
z = A[l -cos(2c|>)], 

where § is the parameter. Show that the total energy of the particle can be written 

m / ds \ 2 Iraq 7 

where s = 4 A sin c}). Deduce that the particle undergoes periodic motion whose frequency is 
amplitude independent (even when the amplitude is large). Demonstrate that the frequency 
of the motion is given by 47t (A/g) 1 / 2 . 
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5 Planetary Motion 



5.1 Introduction 

Newtonian dynamics was initially developed in order to account for the motion of the 
Planets around the Sun. Let us now investigate this problem. 

5.2 Kepler's Laws 

As is well-known, Johannes Kepler was the first astronomer to correctly describe planetary 
motion in the Solar System (in works published between 1609 and 1619). The motion of 
the Planets is summed up in three simple laws: 

1. The planetary orbits are all ellipses which are confocal with the Sun (i.e., the Sun 
lies at one of the focii of each ellipse) . 

2. The radius vectors connecting each planet to the Sun sweep out equal areas in equal 
time intervals. 

3. The squares of the orbital periods of the planets are proportional to the cubes of their 
orbital major radii. 

Let us now see if we can derive Kepler's laws from Newton's laws of motion. 

5.3 Newtonian Gravity 

The force which maintains the Planets in orbit around the Sun is called gravity, and was 
first correctly described by Isaac Newton (in 1687). According to Newton, any two point 
mass objects (or spherically symmetric objects of finite extent) exert a force of attraction 
on one another. This force points along the line of centers joining the objects, is directly 
proportional to the product of the objects' masses, and inversely proportional to the square 
of the distance between them. Suppose that the first object is the Sun, which is of mass 
M, and is located at the origin of our coordinate system. Let the second object be some 
planet, of mass m, which is located at position vector r. The gravitational force exerted on 
the planet by the Sun is thus written 



The constant of proportionality, G, is called the gravitational constant, and takes the value 



f = 



G Mm 



(5.1) 



r. 



G = 6.67300 x KT 11 n^kg- 1 s~ 2 . 



(5.2) 
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An equal and opposite force to (5.1) acts on the Sun. However, we shall assume that the 
Sun is so much more massive than the planet in question that this force does not cause 
the Sun's position to shift appreciably. Hence, the Sun will always remain at the origin of 
our coordinate system. Likewise, we shall neglect the gravitational forces exerted on our 
planet by the other planets in the Solar System, compared to the much larger gravitational 
force exerted by the Sun. 

Incidentally there is something rather curious about Equation (5.1). According to this 
law, the gravitational force acting on an object is directly proportional to its inertial mass. 
But why should inertia be related to the force of gravity? After all, inertia measures the re- 
luctance of a given body to deviate from its preferred state of uniform motion in a straight- 
line, in response to some external force. What has this got to do with gravitational at- 
traction? This question perplexed physicists for many years, and was only answered when 
Albert Einstein published his general theory of relativity in 1916. According to Einstein, 
inertial mass acts as a sort of gravitational charge since it impossible to distinguish an 
acceleration produced by a gravitational field from an apparent acceleration generated by 
observing in a non-inertial reference frame. The assumption that these two types of ac- 
celeration are indistinguishable leads directly to all of the strange predictions of general 
relativity: e.g., clocks in different gravitational potentials run at different rates, mass bends 
space, etc. 

According to Equation (5.1), and Newton's second law, the equation of motion of our 
planet takes the form 

d 2 r GM ft _^ 
Note that the planetary mass, m, has canceled out on both sides of the above equation. 



5.4 Conservation Laws 

Now gravity is a conservative force. Hence, the gravitational force (5.1) can be written (see 
Section 2.5) 

f=-VU, (5.4) 

where the potential energy, U(r), of our planet in the Sun's gravitational field takes the 
form 

U(r) = -™B. (5.5) 
r 

It follows that the total energy of our planet is a conserved quantity — see Section 2.5. In 
other words, 

n v 2 GM 

£ = T (5.6) 

2 r 

is constant in time. Here, £ is actually the planet's total energy per unit mass, and v = 
dr/dt. 
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Figure 5.1: Polar coordinates. 



Gravity is also a central force. Hence, the angular momentum of our planet is a con- 
served quantity — see Section 2.6. In other words, 



which is actually the planet's angular momentum per unit mass, is constant in time. Taking 
the scalar product of the above equation with r, we obtain 



This is the equation of a plane which passes through the origin, and whose normal is 
parallel to h. Since h is a constant vector, it always points in the same direction. We, 
therefore, conclude that the motion of our planet is two-dimensional in nature: i.e., it is 
confined to some fixed plane which passes through the origin. Without loss of generality, 
we can let this plane coincide with the x-y plane. 

5.5 Polar Coordinates 

We can determine the instantaneous position of our planet in the x-y plane in terms of 
standard Cartesian coordinates, (x, y), or polar coordinates, (r, 9), as illustrated in Fig- 



ure 5.1. Here, r = yx 2 +y 2 and 9 = tan _1 (y/x). It is helpful to define two unit vectors, 

e T = r/r and e = e z x e T , at the instantaneous position of the planet. The first always 
points radially away from the origin, whereas the second is normal to the first, in the di- 
rection of increasing 9. As is easily demonstrated, the Cartesian components of e r and e e 
are 



h = r x v, 



(5.7) 



h • r = 0. 



(5.8) 




e r = (cos 9, sin 9), 
e e = (—sin 9, cos 9), 



(5.9) 
(5.10) 
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respectively. 

We can write the position vector of our planet as 

r = re r . (5.11) 

Thus, the planet's velocity becomes 

dr 

v= — =fe r + re T , (5.12) 
dt 

where ' is shorthand for d/dt. Note that e T has a non-zero time-derivative (unlike a Carte- 
sian unit vector) because its direction changes as the planet moves around. As is easily 
demonstrated, from differentiating Equation (5.9) with respect to time, 

e r = 9 (-sinB, cos 9) = 9 e . (5.13) 

Thus, 

v = re T + r9e . (5.14) 
Now, the planet's acceleration is written 

dv d 2 r 

a = — = — T = re T + f e r + (f 9 + r9)e e + r9 e . (5.15) 
dt dt z 

Again, e has a non-zero time-derivative because its direction changes as the planet moves 
around. Differentiation of Equation (5.10) with respect to time yields 

e = 9 (-cos 9, -sin 9) = -9e r . (5.16) 

Hence, 

a= (f-r9 2 )e r + (r9 + 2f 9) e . (5.17) 
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It follows that the equation of motion of our planet, (5.3), can be written 

a= (f-r9 2 )e r + (r9 + 2r9)e =-^^e r . (5.18) 

Since e r and e e are mutually orthogonal, we can separately equate the coefficients of both, 
in the above equation, to give a radial equation of motion, 

r-r6 2 = --^, (5.19) 

and a tangential equation of motion, 

r9 + 2f 9 = 0. (5.20) 

5.6 Conic Sections 

The ellipse, the parabola, and the hyperbola are collectively known as conic sections, since 
these three types of curve can be obtained by taking various different plane sections of a 
right cone. It turns out that the possible solutions of Equations (5.19) and (5.20) are all 
conic sections. It is, therefore, appropriate for us to briefly review these curves. 

An ellipse, centered on the origin, of major radius a and minor radius b, which are 
aligned along the x- and u-axes, respectively (see Figure 5.2), satisfies the following well- 
known equation: 

a 1 b 7 

Likewise, a parabola which is aligned along the +x-axis, and passes through the origin 
(see Figure 5.3), satisfies: 

y 2 -bx = 0, (5.22) 

where b > 0. 

Finally, a hyperbola which is aligned along the +x-axis, and whose asymptotes intersect 
at the origin (see Figure 5.4), satisfies: 

a 1 b 2 

Here, a is the distance of closest approach to the origin. The asymptotes subtend an angle 
4> = tan -1 (b/a) with the x-axis. 

It is not clear, at this stage, what the ellipse, the parabola, and the hyperbola have 
in common (other than being conic sections). Well, it turns out that what these three 
curves have in common is that they can all be represented as the locus of a movable 
point whose distance from a fixed point is in a constant ratio to its perpendicular dis- 
tance to some fixed straight-line. Let the fixed point (which is termed the focus of the 
ellipse/parabola/hyperbola) lie at the origin, and let the fixed line correspond to x = — d 
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Figure 5.3: A parabola. 
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Figure 5.4: A hyperbola. 
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Figure 5.5: Conic sections in polar coordinates. 



(with d > 0). Thus, the distance of a general point (x, y) (which lies to the right of the 



line x = — d) from the origin is ri = yx 2 + y 2 , whereas the perpendicular distance of the 
point from the line x = — d is r 2 = x + d — see Figure 5.5. In polar coordinates, ri = r and 
r 2 = r cos + d. Hence, the locus of a point for which ri and r 2 are in a fixed ratio satisfies 
the following equation: 

n Jx 2 +y 2 r 

- = 1 T~ = n j = e, (5.24) 

r 2 x + d r cos 9 + d 

where e > is a constant. When expressed in terms of polar coordinates, the above 
equation can be rearranged to give 

(5.25) 



1 — e cos ' 
where r c = e d. 

When written in terms of Cartesian coordinates, (5.24) can be rearranged to give 



(x-x c ) 2 , y 2 



a 



i 



+ ^2=h (5-26) 



for e < 1 . Here, 



a = (5.27) 



1 -e 2 



e 



= yfi -e 2 a, (5.28) 



C e a. (5.29) 



1 - e 2 

Equation (5.26) can be recognized as the equation of an ellipse whose center lies at (x c , 0), 
and whose major and minor radii, a and b, are aligned along the x- and y -axes, respectively 
[cf, Equation (5.21)]. 
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When again written in terms of Cartesian coordinates, Equation (5.24) can be rear- 
ranged to give 

y 2 -2r c (x-x c ) =0, (5.30) 

for e = 1 . Here, x c = — r c /2. This is the equation of a parabola which passes through the 
point (x c , 0), and which is aligned along the +x-direction [cf., Equation (5.22)]. 

Finally, when written in terms of Cartesian coordinates, Equation (5.24) can be rear- 
ranged to give 

(x-x c ) 2 y 2 



a 2 b 2 



= 1, (5.31) 



for e > 1 . Here, 



a = -y^r, (5.32) 
e z — 1 



e 2 - 1 a, (5.33) 



x c = — —±- = -ea. (5.34) 
e l — I 

Equation (5.31) can be recognized as the equation of a hyperbola whose asymptotes inter- 
sect at (x c , 0), and which is aligned along the +x-direction. The asymptotes subtend an 
angle 

4> = tan" 1 (^j = tan-^y/e 2 -!) (5.35) 

with the x-axis [cf., Equation (5.23)]. 

In conclusion, Equation (5.25) is the polar equation of a general conic section which is 
confocal with the origin. For e < 1, the conic section is an ellipse. For e = 1, the conic 
section is a parabola. Finally, for e > 1 , the conic section is a hyperbola. 

5.7 Kepler's Second Law 

Multiplying our planet's tangential equation of motion, (5.20), by r, we obtain 

r 2 e + 2rf9=0. (5.36) 
However, the above equation can be also written 

- = 0, (5.37) 

at 

which implies that 

H = r 2 9 (5.38) 

is constant in time. It is easily demonstrated that h is the magnitude of the vector h defined 
in Equation (5.7). Thus, the fact that h is constant in time is equivalent to the statement 
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that the angular momentum of our planet is a constant of its motion. As we have already 
mentioned, this is the case because gravity is a central force. 

Suppose that the radius vector connecting our planet to the origin (i.e., the Sun) sweeps 
out an angle 60 between times t and t + St — see Figure 5.6. The approximately triangular 
region swept out by the radius vector has the area 

6A~lr 2 Se, (5.39) 

since the area of a triangle is half its base (rS0) times its height (r). Hence, the rate at 
which the radius vector sweeps out area is 

dA ,. r 2 S9 r 2 d0 h 

— = hm — — = — — = -. (5.40) 
dt st^o 2 St 2 dt 2 

Thus, the radius vector sweeps out area at a constant rate (since h is constant in time) — 
this is Kepler's second law. We conclude that Kepler's second law of planetary motion is a 
direct consequence of angular momentum conservation. 



5.8 Kepler's First Law 

Our planet's radial equation of motion, (5.19), can be combined with Equation (5.38) to 
give 

Suppose that r = u 1 . It follows that 

it j du d9 , du fr . 

r = 7 = ~ r ^ -r = — H— — . (5.42) 

u 2 d9 dt d9 

Likewise, 
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Hence, Equation (5.41) can be written in the linear form 



^ GM AA^ 

w +u =i^- (5 - 44) 

The general solution to the above equation is 

u(e) = ^[1-ecos(8-9 )], (5.45) 

where e and 9 are arbitrary constants. Without loss of generality, we can set Q = by 
rotating our coordinate system about the z-axis. Thus, we obtain 

r ( 9 ) = i lE — ( 5A ® 

1 — e cos 9 



where 



T < = GM' (S - 47) 
We immediately recognize Equation (5.46) as the equation of a conic section which is 
confocal with the origin (i.e., with the Sun). Specifically, for e < 1, Equation (5.46) is the 
equation of an ellipse which is confocal with the Sun. Thus, the orbit of our planet around 
the Sun in a confocal ellipse — this is Kepler's first law of planetary motion. Of course, a 
planet cannot have a parabolic or a hyperbolic orbit, since such orbits are only appropriate 
to objects which are ultimately able to escape from the Sun's gravitational field. 



5.9 Kepler's Third Law 

We have seen that the radius vector connecting our planet to the origin sweeps out area 
at the constant rate dA/dt = h/2 [see Equation (5.40)]. We have also seen that the 
planetary orbit is an ellipse. Suppose that the major and minor radii of the ellipse are a 
and b, respectively. It follows that the area of the ellipse is A = nab. Now, we expect 
the radius vector to sweep out the whole area of the ellipse in a single orbital period, T. 
Hence, 

_ A 27tab 

T = Wm = — (s - 48) 

It follows from Equations (5.27), (5.28), and (5.47) that 

, 4tt 2 a 3 

In other words, the square of the orbital period of our planet is proportional to the cube of 
its orbital major radius — this is Kepler's third law. 

Note that for an elliptical orbit the closest distance to the Sun — the so-called perihelion 
distance — is [see Equations (5.27) and (5.46)] 

r p = T ^- = a(1-e). (5.50) 
1 + e 
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Likewise, the furthest distance from the Sun — the so-called aphelion distance — is 

r a = ? ^- = a(1+e). (5.51) 
1 — e 

It follows that the major radius, a, is simply the mean of the perihelion and aphelion 
distances, 

a = I*±I*. (5.52) 

The parameter 

e = (5.53) 

r a + r v 

is called the eccentricity, and measures the deviation of the orbit from circularity. Thus, 
e = corresponds to a circular orbit, whereas e — > 1 corresponds to an infinitely elongated 
elliptical orbit. 

As is easily demonstrated from the above analysis, Kepler laws of planetary motion can 
be written in the convenient form 

a (1 — e 2 ) 

(5.54) 



1 — e cos 9 ' 

r 2 9 = (1 -e 2 ) 1/2 na 2 , (5.55) 
GM = n 2 a 3 , (5.56) 

where a is the mean orbital radius (i.e., the major radius), e the orbital eccentricity, and 
n = 2n/T the mean orbital angular velocity. 

5.10 Orbital Energies 

According to Equations (5.6) and (5.14), the total energy per unit mass of an object in 
orbit around the Sun is given by 

e = * 2+ f 62 -™ (5.57) 

2 r 

It follows from Equations (5.38), (5.42), and (5.47) that 



'du\ 2 2 
- +u -2uu c 



(5.58) 



where u = r \ and u c = r c \ However, according to Equation (5.46), 

u(9) =u c (1 -e cos0). (5.59) 
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The previous two equations can be combined with Equations (5.47) and (5.50) to give 

S = ^—(e 2 -]) = -— e-1 . (5.60) 
2 2r p 

We conclude that elliptical orbits (e < 1 ) have negative total energies, whereas parabolic 
orbits (e = 1) have zero total energies, and hyperbolic orbits (e > 1) have positive total 
energies. This makes sense, since in a conservative system in which the potential energy 
at infinity is set to zero [see Equation (5.5)] we expect bounded orbits to have negative 
total energies, and unbounded orbits to have positive total energies — see Section 3.2. Thus, 
elliptical orbits, which are clearly bounded, should indeed have negative total energies, 
whereas hyperbolic orbits, which are clearly unbounded, should indeed have positive total 
energies. Parabolic orbits are marginally bounded (i.e., an object executing a parabolic 
orbit only just escapes from the Sun's gravitational field), and thus have zero total energy. 
For the special case of an elliptical orbit, whose major radius a is finite, we can write 

fi=-™ (5.61, 
2 a 

It follows that the energy of such an orbit is completely determined by the orbital major 
radius. 

Consider an artificial satellite in an elliptical orbit around the Sun (the same consid- 
erations also apply to satellites in orbit around the Earth). At perihelion, f = 0, and 
Equations (5.57) and (5.60) can be combined to give 

= yTTe. (5.62) 



Here, v t = r9 is the satellite's tangential velocity, and v c = y G M/r p is the tangential 
velocity that it would need in order to maintain a circular orbit at the perihelion distance. 
Likewise, at aphelion, 

= VI -e, (5.63) 



v t 



where v c = M/r a is now the tangential velocity that the satellite would need in order 
to maintain a circular orbit at the aphelion distance. 

Suppose that our satellite is initially in a circular orbit of radius ti , and that we wish 
to transfer it into a circular orbit of radius r 2 , where r 2 > r-\ . We can achieve this by tem- 
porarily placing the satellite in an elliptical orbit whose perihelion distance is ri , and whose 
aphelion distance is r 2 . It follows, from Equation (5.53), that the required eccentricity of 
the elliptical orbit is 

e = (5.64) 

According to Equation (5.62), we can transfer our satellite from its initial circular orbit 
into the temporary elliptical orbit by increasing its tangential velocity (by briefly switching 
on the satellite's rocket motor) by a factor 

on = VI + e. (5.65) 
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velocity increase 




Figure 5.7: A transfer orbit between two circular orbits. 



We must next allow the satellite to execute half an orbit, so that it attains its aphelion 
distance, and then boost the tangential velocity by a factor [see Equation (5.63)] 



The satellite will now be in a circular orbit at the aphelion distance, r 2 . This process 
is illustrated in Figure 5.7. Obviously, we can transfer our satellite from a larger to a 
smaller circular orbit by performing the above process in reverse. Note, finally, from Equa- 
tion (5.62), that if we increase the tangential velocity of a satellite in a circular orbit about 
the Sun by a factor greater than \fl then we will transfer it into a hyperbolic orbit (e > 1 ), 
and it will eventually escape from the Sun's gravitational field. 

5.11 Kepler Problem 

In a nutshell, the so-called Kepler problem consists of determining the radial and angular 
coordinates, r and 9, respectively, of an object in a Keplerian orbit about the Sun as a 
function of time. 

Consider an object in a general Keplerian orbit about the Sun which passes through its 
perihelion point, r = r p and 9 = 0, at t = 0. It follows from the previous analysis that 



<x 2 = 



vT^e' 



(5.66) 



r = 



r p (1 + ej 
1 + e cos 9 ' 



(5.67) 



and 




GM 



r 



(5.68) 
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where e, h. = yG Mr p (1 + e), and S = G M (e — 1)/(2r p ) are the orbital eccentricity, 
angular momentum per unit mass, and energy per unit mass, respectively. The above 
equation can be rearranged to give 

r 2 = e-1 e + 1 p . + . (5.69) 

r p r 2 r 



Taking the square-root, and integrating, we obtain 

r dr 



VGM t. (5.70) 



^ [2r+(e-l)r 2 /r p -(e + 1)r p ]V2 

Consider an elliptical orbit characterized by < e < 1 . Let us write 

r = t^MI -e cosE), (5.71) 
1 — e 

where E is termed the elliptic anomaly. In fact, E is an angle which varies between — n and 
+n. Moreover, the perihelion point corresponds to E = 0, and the aphelion point to E = n. 
Now, 

dr = -^e sinEdE, (5.72) 
1 — e 



whereas 



2r+(e-1)--(e + 1)r p = e 2 (1 - cos 2 E) 

r p i 6 



1 -e 



e z sin z E. (5.73) 



Thus, Equation (5.70) reduces to 



(1-ecosE)dE= — =- t, (5.74) 



o 



a 3 



where a = r p /(1 — e). This equation can immediately be integrated to give 

E-e sinE = 2 ^(|) > (5-75) 

where T = 2n (q 3 /GM) 1/2 is the orbital period. Equation (5.75), which is known asKepler's 
equation, is a transcendental equation which does not possess a simple analytic solution. 
Fortunately, it is fairly straightforward to solve numerically. For instance, using an iterative 
approach, if E n is the nth guess then 

E n+1 = 2n + e sin E n . (5.76) 
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The above iteration scheme converges very rapidly (except in the limit as e — > 1 ) . 
Equations (5.67) and (5.71) can be combined to give 

cos E — e 

cos 9 = -. (5.77) 

1 — e cos E 



Thus, 

1 + cose = 2 cos 2 (9/2) = 2( V e)COs2 ' E/2) , (5.78) 

1 — e cos E 



and 



l-cos8=2sin'(9/2) = 2(1 1 +e)SinZ(E/2) . (5.79) 

1 — e cos E 



The previous two equations imply that 



1 + e N 



tan(9/2) = l^rr^J tan(E/2). (5.80) 

We conclude that, in the case of an elliptical orbit, the solution of the Kepler problem 
reduces to the solution of the following three equations: 

E-esinE = 2n(^\, (5.81) 



,T, 

r = a(l-ecosE), (5.82) 

/, \ 1/2 

tan(9/2) = (y^l tan(E/2). (5.83) 

Here, T = 2n (a 3 /GM) 1/2 and a = r p /(1 — e). Incidentally, it is clear that if t — > t + T then 
E — > E + 2n, and 0^6 + In. In other words, the motion is periodic with period T. 

For the case of a parabolic orbit, characterized by e = 1 , similar analysis to the above 
yields: 

„ P 3 /GM\ V2 , co „ 

P + T - (^rj *> (5 - 84) 

r = r p (l+P 2 ), (5.85) 
tan(9/2) = P. (5.86) 

Here, P is termed the parabolic anomaly, and varies between — oo and +oo, with the per- 
ihelion point corresponding to P = 0. Note that Equation (5.84) is a cubic equation, 
possessing a single real root, which can, in principle, be solved analytically. However, a 
numerical solution is generally more convenient. 
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Finally, for the case of a hyperbolic orbit, characterized by e > 1, we obtain: 

/G M\ 1/2 

esinhH-H = f-^-J t, (5.87) 

r = a(ecoshH-l), (5.88) 

tan(9/2) = tanh ( H / 2 )- ( 5 - 89 ) 

Here, H is termed the hyperbolic anomaly, and varies between — oo and +oo, with the 
perihelion point corresponding to H = 0. Moreover, a = r p /(e — 1). As in the elliptical 
case, Equation (5.87) is a transcendental equation which is most easily solved numerically. 



5.12 Motion in a General Central Force-Field 

Consider the motion of an object in a general (attractive) central force-field characterized 
by the potential energy per unit mass function V(r). Since the force-field is central, it still 
remains true that 

H = r 2 9 (5.90) 
is a constant of the motion. As is easily demonstrated, Equation (5.44) generalizes to 

d 2 u 1 dV 

+ u = -— — , (5.91) 



d9 2 H 2 du' 
where u = r _1 . 

Suppose, for instance, that we wish to find the potential V(r) which causes an object 
to execute the spiral orbit 

r = r 9 2 . (5.92) 
Substitution of u = (r 2 ) -1 into Equation (5.91) yields 

= -h 2 f6rou 2 + u) . (5.93) 



Integrating, we obtain 



V(u) = -h 2 ( 2 r u 3 + Vi , ( 5 -94) 



2 



or 



,2/ 2r , 1 



V(t) = -h^-f + — j . (5.95) 

In other words, the spiral pattern (5.92) is obtained from a mixture of an inverse-square 
and inverse-cube potential. 



Planetary Motion 



71 



5.13 Motion in a Nearly Circular Orbit 

In principle, a circular orbit is a possible orbit for any attractive central force. However, 
not all forces result in stable circular orbits. Let us now consider the stability of circular 
orbits in a general central force-field. Equation (5.41) generalizes to 

T-^ = f(r), (5.96) 

where f (r) is the radial force per unit mass. For a circular orbit, f = 0, and the above 
equation reduces to 

-^ = f(r c ), (5.97) 
1 c 

where r c is the radius of the orbit. 

Let us now consider small departures from circularity. Let 

x = r-r c . (5.98) 

Equation (5.96) can be written 

x- , h " , 3 =f(r c + x). (5.99) 
(r c + x) 3 

Expanding the two terms involving r c + x as power series in x/r c , and keeping all terms up 
to first order, we obtain 



x-^(l-3^) =f(r c ) + f'(r c )x, 



(5.100) 



where ' denotes a derivative. Making use of Equation (5.97), the above equation reduces 
to 

3f(r c ) 



x + 



-f'(Tc) 



x = 0. (5.101) 



If the term in square brackets is positive then we obtain a simple harmonic equation, which 
we already know has bounded solutions — i.e., the orbit is stable to small perturbations. On 
the other hand, if the term is square brackets is negative then we obtain an equation whose 
solutions grow exponentially in time — i.e., the orbit is unstable to small oscillations. Thus, 
the stability criterion for a circular orbit of radius r c in a central force-field characterized 
by a radial force (per unit mass) function f (r) is 

f(r c ) + jf'(r c ) <0. (5.102) 

For example, consider an attractive power-law force function of the form 

f(c)=-cr n , (5.103) 
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where c > 0. Substituting into the above stability criterion, we obtain 



-cr c n - V r c n <°) (5.104) 



cn 
T 
or 

n > -3. (5.105) 



We conclude that circular orbits in attractive central force-fields which decay faster than 
r 3 are unstable. The case n = — 3 is special, since the first-order terms in the expansion 
of Equation (5.99) cancel out exactly, and it is necessary to retain the second-order terms. 
Doing this, it is easily demonstrated that circular orbits are also unstable for inverse-cube 
(n = —3) forces. 

An apsis is a point in an orbit at which the radial distance, r, assumes either a max- 
imum or a minimum value. Thus, the perihelion and aphelion points are the apsides of 
planetary orbits. The angle through which the radius vector rotates in going between two 
consecutive apsides is called the apsidal angle. Thus, the apsidal angle for elliptical orbits 
in an inverse-square force-field is n. 

For the case of stable nearly circular orbits, we have seen that r oscillates sinusoidally 
about its mean value, r c . Indeed, it is clear from Equation (5.101) that the period of the 
oscillation is 

T = — m . (5.106) 

[-3f(r c )/r c -f'(r c )] 1/2 

The apsidal angle is the amount by which 9 increases in going between a maximum and 
a minimum of r. The time taken to achieve this is clearly T/2. Now, 9 = h/r 2 , where Tt is 
a constant of the motion, and r is almost constant. Thus, 9 is approximately constant. In 
fact, 

1/2 

(5.107) 



f(r c 



• h 

e~- = 

where use has been made of Equation (5.97). Thus, the apsidal angle, ij), is given by 

-1/2 



3 + r c 



f(r c 



(5.108) 



For the case of attractive power-law central forces of the form f (r) = — c r n , where 
c > 0, the apsidal angle becomes 

* = IsT^F- (5109) 

Now, it should be clear that if an orbit is going to close on itself then the apsidal angle 
needs to be a rational fraction of 2n. There are, in fact, only two small integer values of 
the power-law index, n, for which this is the case. As we have seen, for an inverse-square 
force law (i.e., n = —2), the apsidal angle is n. For a linear force law (i.e., n = 1), the 
apsidal angle is n/2 — see Section 4.2. However, for quadratic (i.e., n = 2) or cubic (i.e., 
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n = 3) force laws, the apsidal angle is an irrational fraction of 2n, which means that 
non-circular orbits in such force-fields never close on themselves. 

Let us, finally, calculate the apsidal angle for a nearly circular orbit of radius r c in a 
slightly modified (attractive) inverse-square force law of the form 



w , k e 

where e/(kr 2 ) is small. Substitution into Equation (5.108) yields 



(5.110) 
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2kr- 3 

3 + r c 



4er: 



-51-V2 



= 71 



-]cr-i-er-\ 
Expanding to first-order in e/(kr c 2 ), we obtain 

,-V2 



3-2 



1 + 2e/(kr 2 
1 + e/(W) 



211-V2 



(5.111) 



x|> ~7t(3-2[1 +e/(kr c 2 )]) ' /Z = n [l - 2 e/(kr c 2 )] 1/2 ~ n [1 + e/(kr c 2 )]. (5.112) 

We conclude that if e > then the perihelion (or aphelion) of the orbit advances by an an- 
gle 2 e/(kr 2 ) every rotation period. It turns out that the general relativistic corrections to 
Newtonian gravity give rise to a small 1 /r 4 modification (with e > 0) to the Sun's gravita- 
tional field. Hence, these corrections generate a small precession in the perihelion of each 
planet orbiting the Sun. This effect is particularly large for Mercury — see Section 12.13. 



5.14 Exercises 

5.1. Halley's comet has an orbital eccentricity of e = 0.967 and a perihelion distance of 55, 000, 000 
miles. Find the orbital period, and the comet's speed at perihelion and aphelion. 

5.2. A comet is first seen at a distance of d astronomical units (1 astronomical unit is the mean 
Earth-Sun distance) from the Sun, and is traveling with a speed of q times the Earth's mean 
speed. Show that the orbit of the comet is hyperbolic, parabolic, or elliptical, depending on 
whether the quantity q 2 d is greater than, equal to, or less than 2, respectively 

5.3. Consider a planet in a Keplerian orbit of major radius a and eccentricity e about the Sun. 
Suppose that the eccentricity of the orbit is small (i.e., < e <C 1), as is indeed the case for 
all of the planets except Mercury and Pluto. Demonstrate that, to first-order in e, the orbit 
can be approximated as a circle whose center is shifted a distance e a from the Sun, and that 
the planet's angular motion appears uniform when viewed from a point (called the Equant) 
which is shifted a distance lea from the Sun, in the same direction as the center of the circle. 
This theorem is the basis of the Ptolomaic model of planetary motion. 

5.4. How long (in days) does it take the Sun-Earth radius vector to rotate through 90°, starting at 
the perihelion point? How long does it take starting at the aphelion point? The period and 
eccentricity of the Earth's orbit are T = 365.24 days, and e = 0.01 673, respectively. 

5.5. Solve the Kepler problem for a parabolic orbit to obtain Equations (5.84)-(5.86). 
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5.6. Solve the Kepler problem for a hyperbolic orbit to obtain Equations (5.87)-(5.89). 

5.7. A comet is in a parabolic orbit lying in the plane of the Earth's orbit. Regarding the Earth's 
orbit as a circle of radius a, show that the points where the comet intersects the Earth's orbit 
are given by 



where p is the perihelion distance of the comet, defined at 6 = 0. Show that the time interval 
that the comet remains inside the Earth's orbit is the faction 



of a year, and that the maximum value of this time interval is 2/3n year, or about 11 weeks. 
5.8. Prove that in the case of a central force varying inversely as the cube of the distance 



where A, B, C are constants. 

5.9. The orbit of a particle moving in a central field is a circle passing through the origin, namely 
r = ro cos 0. Show that the force law is inverse-fifth power. 

5.10. A particle moving in a central field describes a spiral orbit r = ro exp(k9). Show that the 
force law is inverse-cube, and that 9 varies logarithmically with t. Show that there are two 
other possible types of orbit in this force-held, and give their equations. 

5.11. A particle moves in a spiral orbit given by r = a 9. Suppose that 9 increases linearly with t. 
Is the force acting on the particle central in nature? If not, determine how 9 would have to 
vary with t in order to make the force central. 

5.12. A particle moves in a circular orbit of radius ro in an attractive central force-held of the form 
f(r) = — c exp(— r/a)/r 2 , where c > and a > 0. Demonstrate that the orbit is only stable 
provided that ro < a. 

5.13. A particle moves in a circular orbit in an attractive central force-field of the form f(r) = 
-cr 3 , where c > 0. Demonstrate that the orbit is unstable to small perturbations. 

5.14. If the Solar System were embedded in a uniform dust cloud, what would the apsidal angle 
of a planet be for motion in a nearly circular orbit? Express your answer in terms of the ratio 
of the mass of dust contained in a sphere, centered on the Sun, whose radius is that of the 
orbit, to the mass of the Sun. This model was once suggested as a possible explanation for 
the advance of the perihelion of Mercury. 

5.15. The potential energy per unit mass of a particle in the gravitational field of an oblate spheroid, 
like the Earth, is 



cos 9 = -1 + — , 
a 




r 2 =At 2 + Bt+C, 
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where r refers to distances in the equatorial plane, M is the Earth's mass, and e = (2/5) R AR. 
Here, R = 4000 mi is the Earth's equatorial radius, and AR = 1 3 mi the difference between the 
equatorial and polar radii. Find the apsidal angle for a satellite moving in a nearly circular 
orbit in the equatorial plane of the Earth. 
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6 Two-Body Dynamics 



6. 1 Introduction 

This chapter examines the motion of dynamical systems consisting of two freely moving 
and mutually interacting point objects. 



6.2 Reduced Mass 

Suppose that our first object is of mass mi , and is located at position vector r-\ . Likewise, 
our second object is of mass m 2 , and is located at position vector r 2 . Let the first object 
exert a force f 2 i on the second. By Newton's third law, the second object exerts an equal 
and opposite force, f 12 = — f 2 i, on the first. Suppose that there are no other forces in the 
problem. The equations of motion of our two objects are thus 

d2ri ( rA1 > 

mi —j- = -f, (6.1) 



dt 2 
d 2 r 2 



m 2 ^ = f, (6.2) 



where f = f 2 i . 

Now, the center of mass of our system is located at 

mi ri + m 2 r 2 
mi + m 2 

Hence, we can write 



(6.3) 



ri = r cm — — r, (6.4) 

mi + m 2 

r 2 = r cm + — — — r, (6.5) 
mi + m 2 

where r = r 2 — ri. Substituting the above two equations into Equations (6.1) and (6.2), 
and making use of the fact that the center of mass of an isolated system does not accelerate 
(see Section 2.6), we find that both equations yield 

(1 g=f, (6.6) 

where 

mi m 2 

M- = : (6-7) 

mi + m 2 
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is called the reduced mass. Hence, we have effectively converted our original two-body 
problem into an equivalent one-body problem. In the equivalent problem, the force f is 
the same as that acting on both objects in the original problem (modulo a minus sign). 
However, the mass, \x, is different, and is less than either of mi or m 2 (which is why it is 
called the "reduced" mass). 



6.3 Binary Star Systems 

Approximately half of the stars in our galaxy are members of so-called binary star systems. 
Such systems consist of two stars orbiting about their common center of mass. The distance 
separating the stars is always much less than the distance to the nearest neighbour star. 
Hence, a binary star system can be treated as a two-body dynamical system to a very good 
approximation. 

In a binary star system, the gravitational force which the first star exerts on the second 

is 

. G mi m 2 



r 3 



r, (6.8) 



where r = r 2 — ri . As we have seen, a two-body system can be reduced to an equivalent one- 
body system whose equation of motion is of the form (6.6), where \i = mi m 2 /(mi + m 2 ). 
Hence, in this particular case, we can write 

mi m 2 d 2 r G mi m 2 

r, (6.9) 



mi + m 2 dt 2 r 3 

which gives 



where 



d 2 r GM „ , „ 

w=~— r ' (610) 



M. = mi + m 2 . (6.11) 



Equation (6.10) is identical to Equation (5.3), which we have already solved. Hence, 
we can immediately write down the solution: 



where 



and 



with 



(r cosB, r sine, 0), (6.12) 
a (1 — e 2 ) 

(6.13) 
(6.14) 



1 — e cos 6 ' 



d6 _ H 

dt 



Q= (1-e 2 )GM - (6 - 15) 
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Figure 6.1: An example binary star orbit. 



Here, h. is a constant, and we have aligned our Cartesian axes so that the plane of the orbit 
coincides with the x-y plane. According to the above solution, the second star executes a 
Keplerian elliptical orbit, with major radius a and eccentricity e, relative to the first star, 
and vice versa. From Equation (5.49), the period of revolution, T, is given by 



T 



I4n 2 a 3 
GM ' 



(6.16) 



In the inertial frame of reference whose origin always coincides with the center of 
mass — the so-called center of mass frame — the position vectors of the two stars are 



r 2 



m 2 



mi + m.2 



mi + m.2 



r, 



(6.17) 
(6.18) 



where r is specified above. Figure 6.1 shows an example binary star orbit, in the center 
of mass frame, calculated with m-\/m 2 = 0.5 and e = 0.2. Here, the triangles and squares 
denote the positions of the first and second star, respectively (which are always diagram- 
matically opposite one another, as indicated by the arrows) . It can be seen that both stars 
execute elliptical orbits about their common center of mass. 

Binary star systems have been very useful to astronomers, since it is possible to de- 
termine the masses of both stars in such a system by careful observation. The sum of the 
masses of the two stars, M = mi + m.2, can be found from Equation (6.16) after a measure- 
ment of the major radius, a (which is the mean of the greatest and smallest distance apart 
of the two stars during their orbit), and the orbital period, T. The ratio of the masses of 
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Figure 6.2: A collision viewed in the center of mass frame. 



the two stars, mi/m 2 , can be determined from Equations (6.17) and (6.18) by observing 
the fixed ratio of the relative distances of the two stars from the common center of mass 
about which they both appear to rotate. Obviously, given the sum of the masses, and the 
ratio of the masses, the individual masses themselves can then be calculated. 

6.4 Scattering in the Center of Mass Frame 

Let us now consider scattering due to the collision of two particles. We shall restrict our 
discussion to particles which interact via conservative central forces. It turns out that scat- 
tering looks particularly simple when viewed in the center of mass frame. Let us, therefore, 
start our investigation by considering two-particle scattering in the center of mass frame. 

As before, the first particle is of mass mi , and is located at position vector ri , whereas 
the second particle is of mass m,2, and is located at r 2 . By definition, there is zero net linear 
momentum in the center of mass frame at all times. Hence, if the first particle approaches 
the collision point with momentum p then the second must approach with momentum 
—p. Likewise, after the collision, if the first particle recedes from the collision point with 
momentum p' then the second must recede with momentum — p' — see Figure 6.2. Fur- 
thermore, since the interaction force is conservative, the total kinetic energy before and 
after the collision must be the same. It follows that the magnitude of the final momentum 
vector, p', is equal to the magnitude of the initial momentum vector, p. Because of this, the 
collision event is completely specified once the angle through which the first particle is 
scattered is given. Of course, in the center of mass frame, the second particle is scattered 
through the same angle — see Figure 6.2. 

Suppose that the two particles interact via the potential U(r), where r is the distance 
separating the particles. As we have seen, the two-body problem sketched in Figure 6.2 
can be converted into the equivalent one-body problem sketched in Figure 6.3. In this 
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origin 



Figure 6.3: The one-body equivalent to the previous figure. 



equivalent problem, a particle of mass u. = mi m 2 /(mi + m 2 ) is scattered in the fixed 
potential U(r), where r is now the distance from the origin. The vector position r of the 
particle in the equivalent problem corresponds to the relative position vector r 2 — i"i in the 
original problem. It follows that the angle through which the particle is scattered in the 
equivalent problem is the same as the scattering angle in the original problem. 

The scattering angle, 0, is largely determined by the so-called impact parameter, b, 
which is the distance of closest approach of the two particles in the absence of an interaction 
potential. In the equivalent problem, b is the distance of closest approach to the origin in 
the absence of an interaction potential — see Figure 6.3. If b = then we have a head-on 
collision. In this case, we expect the two particles to reverse direction after colliding: i.e., 
we expect = n. Likewise, if b is large then we expect the two particles to miss one 
another entirely, in which case = 0. It follows that the scattering angle, 0, is a decreasing 
function of the impact parameter, b. 

Suppose that the polar coordinates of the particle in the equivalent problem are (r, d). 
Let the particle approach the origin from the direction d = 0, and attain its closest distance 
to the origin when d = 8. From symmetry the angle oc in Figure 6.3 is equal to the angle 
(3. However, from simple geometry, oc = 6. Hence, 



Now, by analogy with Equation (5.58), the conserved total energy E in the equivalent 
problem, which can easily be shown to be the same as the total energy in the original 
problem, is given by 



where u = r 1 , and h is the angular momentum per unit mass in the equivalent problem. 



= 71-2©. 



(6.19) 




(6.20) 
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It is easily seen that 

h = bv 0O = b^J , (6.21) 
where Voo is the approach velocity in the equivalent problem at large r. It follows that 





The above equation can be rearranged to give 

dd b 



du yjl -b 2 u 2 -U(u)/E* 

Integration yields 

e 



+ U(u). (6.22) 



(6.23) 



bdU (6.24) 



o yjl -b 2 u 2 -U(u)/E* 

Here, u mQX = 1 /r min , where r min is the distance of closest approach. Since, by symmetry, 
(du/d-&) Umax = 0, it follows from Equation (6.23) that 

1 - b 2 u 2 max - U(u mQX )/E = 0. (6.25) 

Equations (6.19) and (6.24) enable us to calculate the function b(9) for a given interaction 
potential, U(r), and a given total energy, E, of the two particles in the center of mass frame. 
The function b(9) tells us which impact parameter corresponds to which scattering angle, 
and vice versa. 

Instead of two particles, suppose that we now have two counter-propagating beams of 
identical particles (with the same properties as the two particles described above) which 
scatter one another via binary collisions. What is the angular distribution of the scattered 
particles? Consider pairs of particles whose impact parameters lie in the range b to b + db. 
These particles are scattered in such a manner that their scattering angles lie in the range 
to 6 + d6, where 9 is determined from inverting the function b(9), and 

de = idbtsW (6 - 26) 

Incidentally, we must take the modulus of db(9)/d0 because b(9) is a decreasing function 
of 9. Assuming, as seems reasonable, that the scattering is azimuthally symmetric, the 
range of solid angle into which the particles are scattered is 

, ^ • „ , ~ 27r sin 9 db 

dO = 2n sin 9 d9 = — — (6.27) 

|db/d9| 

Finally, the cross-sectional area of the annulus through which incoming particles must pass 
if they are to have impact parameters in the range b to b + db is 

dff = 27rbdb. (6.28) 
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The previous two equations allow us to define the differential scattering cross-section: 



da 



dO sin 



db 



d9 



(6.29) 



The differential scattering cross-section has units of area per steradian, and specifies the 
effective target area for scattering into a given range of solid angle. For two uniform 
beams scattering off one another, the differential scattering cross-section thus effectively 
specifies the probability of scattering into a given range of solid angle. The total scattering 
cross-section is the integral of the differential cross-section over all solid angles, 



a = 



da 
dO 



da, 



(6.30) 



and measures the effective target area for scattering in any direction. Thus, if the flux of 
particles per unit area per unit time, otherwise known as the intensity, of the two beams is 
I, then the number of particles of a given type scattered per unit time is simply I a. 

Let us now calculate the scattering cross-section for the following very simple interac- 
tion potential: 

U(r)H ° . (6.31) 



oo 



r < a 



This is the interaction potential of impenetrable spheres which only exert a force on one 
another when they are in physical contact (e.g., billiard balls). If the particles in the first 
beam have radius R 1} and the particles in the second beam have radius R 2 , then a = Ri +R2. 
In other words, the centers of two particles, one from either beam, can never be less than 
a distance a apart, where a is the sum of their radii (since the particles are impenetrable 
spheres). 

Equations (6.19), (6.24), and (6.31) yield 



e = n-2 



Va 



bdu 



n — 2 sin ] (b/a). 



VI -b 2 u 2 
The above formula can be rearranged to give 



(6.32) 



Note that 



db 


1 


db 2 


a 2 


d9 


~~ 2 


d6 


" 2 



b(0) = a cos(0/2). 



sin(e/2) cos(6/2) 



a 
T 



sine. 



(6.33) 



(6.34) 



Hence, Equations (6.29) and (6.34) yield 

da 

da 



a 

T 



(6.35) 
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We thus conclude that when two beams of impenetrable spheres collide, in the center of 
mass frame, the particles in the two beams have an equal probability of being scattered in 
any direction. The total scattering cross-section is 



CT 



da 

da 



dO = n a 2 . 



(6.36) 



Obviously, this result makes a lot of sense — the total scattering cross-section for two im- 
penetrable spheres is simply the area of a circle whose radius is the sum of the radii of the 
two spheres. 

Let us now consider scattering by an inverse-square interaction force whose potential 
takes the form 

U(T) = -. 



It follows from Equations (6.24) and (6.25) that 

' 



where a = k/(Eb), and 
Integration yields 



1 -b 2 u 2 -ku/E 



dx 



ax 



0. 



e 



71 



— sin 



a 



Hence, from Equation (6.19), we obtain 

9=2 sin" 1 

The above equation can be rearranged to give 



V4 + cc 2 



a 

V4 + a 2 / 



b 2 = 



4E 2 



CO 



t 2 0/2). 



Thus, 



2b 



db 



d9 



sin 9 



8E 2 sin 4 (9/2)' 



Finally, using Equation (6.29), we get 

da k 2 



1 



da 16E 2 sin 4 (9/2) 



(6.37) 

(6.38) 

(6.39) 
(6.40) 

(6.41) 



(6.42) 



(6.43) 



(6.44) 



There are a number of things to note about the above formula. First, the scattering cross- 
section is proportional to k 2 . This means that repulsive (k > 0) and attractive (k < 0) 
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inverse-square interaction forces of the same strength give rise to identical angular distri- 
butions of scattered particles. Second, the scattering cross-section is proportional to E 2 . 
This means that inverse-square interaction forces are much more effective at scattering low 
energy, rather than high energy, particles. Finally, the differential scattering cross-section is 
proportional to sin 4 (9/2). This means that, with an inverse-square interaction force, the 
overwhelming majority of "collisions" consist of small angle scattering events (i.e., 9 <C 1). 

Let us now consider a specific case. Suppose that we have particles of electric charge q 
scattering off particles of the same charge. The interaction potential due to the Coulomb 
force between the particles is simply 

U(r) = (6.45) 
47teor 

Thus, it follows from Equation (6.44) [with k = q 2 /(47C eo)] that the differential scattering 
cross-section takes the form 

is: = q 4 1 r6 461 

dO 16(47te ) 2 E 2 sin 4 (9/2r 1 J 

This very famous formula is known as the Rutherford scattering cross-section, since it was 
first derived by Earnst Rutherford for use in his celebrated oc-particle scattering experiment. 

Note, finally, that if we try to integrate the Rutherford formula to obtain the total 
scattering cross-section then we find that the integral is divergent, due to the very strong 
increase in dcx/dQ. as 9 — > 0. This implies that the Coulomb potential (or any other 
inverse-square-law potential) has an effectively infinite range. In practice, however, an 
electric charge is generally surrounded by charges of the opposite sign which shield the 
Coulomb potential of the charge beyond a certain distance. This shielding effect allows 
the charge to have a finite total scattering cross-section (for the scattering of other electric 
charges) . However, the total scattering cross-section of the charge depends (albeit, loga- 
rithmically) on the shielding distance, and, hence, on the nature and distribution of the 
charges surrounding it. 



6.5 Scattering in the Laboratory Frame 

We have seen that two-particle scattering looks fairly simple when viewed in the center of 
mass frame. Unfortunately, we are not usually in a position to do this. In the laboratory, 
the most common scattering scenario is one in which the second particle is initially at rest. 
Let us now investigate this situation. 

Suppose that, in the center of mass frame, the first particle has velocity vi before the 
collision, and velocity v{ after the collision. Likewise, the second particle has velocity v 2 
before the collision, and v' 2 after the collision. We know that 



mi vi + m 2 v 2 = mi v( + m 2 v' 2 = 



(6.47) 
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center of mass frame laboratory frame 




Figure 6.4: Scattering in the center of mass and laboratory frames. 



in the center of mass frame. Moreover, since the collision is assumed to be elastic (i.e., 
energy conserving), 

v\ = v b (6.48) 
v' 2 = v 2 . (6.49) 

Let us transform to a new inertial frame of reference — which we shall call the laboratory 
frame — which is moving with the uniform velocity — v 2 with respect to the center of mass 
frame. In the new reference frame, the first particle has initial velocity V] = Vi — v 2 , and 
final velocity Vj = vj — v 2 . Furthermore, the second particle is initially at rest, and has the 
final velocity V2 = v' 2 — v 2 . The relationship between scattering in the center of mass frame 
and scattering in the laboratory frame is illustrated in Figure 6.4. 

In the center of mass frame, both particles are scattered through the same angle 0. 
However, in the laboratory frame, the first and second particles are scattered by the (gen- 
erally different) angles i|j and respectively. 

Defining x- and u-axes, as indicated in Figure 6.4, it is easily seen that the Cartesian 
components of the various velocity vectors in the two frames of reference are: 



Vl 


= V!(1,0), 


(6.50) 


v 2 


= (m 1 /m 2 )v 1 (-1, 0), 


(6.51) 


vi 


= vi (cos0, sin0), 


(6.52) 




= (m 1 /m 2 )vi(— cos0, — sin0), 


(6.53) 




= (1 +mi/m 2 )vi (1,0), 


(6.54) 


v: 


= vi (cos0 + m 1 /m 2 , sin0), 


(6.55) 




= (mi/m 2 ) vi (1 — cos0, — sin0). 


(6.56) 



In the center of mass frame, let E be the total energy, let Ei = (1 /2) mi vf and E 2 = 
(1/2) m 2 v| be the kinetic energies of the first and second particles, respectively, before the 
collision, and let Ej = (1/2) mi v{ 2 and V 2 = (1/2) m 2 v^ 2 be the kinetic energies of the first 
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and second particles, respectively, after the collision. Of course, E = Ei + E 2 = Ej + E 2 . 
In the laboratory frame, let £ be the total energy. This is, of course, equal to the kinetic 
energy of the first particle before the collision. Likewise, let £{ = (1 /I) mi V{ 2 and £[ = 
(1 /I] m 2 V 2 /2 be the kinetic energies of the first and second particles, respectively, after the 
collision. Of course, £ = £{ + £[. 

The following results can easily be obtained from the above definitions and Equa- 
tions (6.50)-(6.56). First, 

' mi+7U2 ^ (6.57) 



£ = 



m 2 



Hence, the total energy in the laboratory frame is always greater than that in the center 
of mass frame. In fact, it can be demonstrated that the total energy in the center of mass 
frame is less than the total energy in any other inertial frame. Second, 



Ei 

E 2 



e; = ( -J5*_) 

Vmi + m 2 / 



mi + m 2 

mi 
mi + m 2 



E. 



(6.58) 
(6.59) 



These equations specify how the total energy in the center of mass frame is distributed 
between the two particles. Note that this distribution is unchanged by the collision. Finally, 



Si 



£1 = 



m 2 + 2 mi m 2 cos 9 + m 2 
(mi + m 2 ) 2 

2 mi m 2 (1 — cos 0) 



(m-! + m 2 ) 2 



£. 



(6.60) 
(6.61) 



These equations specify how the total energy in the laboratory frame is distributed between 
the two particles after the collision. Note that the energy distribution in the laboratory 
frame is different before and after the collision. 

Equations (6.50)-(6.56), and some simple trigonometry yield 



tan\l> = 



sin0 



cos + mi/m 2 ' 



and 

tanC 

The last equation implies that 



sin0 



1 — cos 



71 

2 



tan 



' 2' 



71 



Differentiating Equation (6.62) with respect to 0, we obtain 

dtanij) 1 + (mi/m 2 ) cos0 



d0 



(cos + mi/m 2 ) 2 



(6.62) 



(6.63) 



(6.64) 



(6.65) 
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Thus, tanijj attains an extreme value, which can be shown to correspond to a maximum 
possible value of i!>, when the numerator of the above expression is zero: i.e., when 

COS = -— . (6.66) 
mi 

Note that it is only possible to solve the above equation when rrii > m 2 . If this is the case 
then Equation (6.62) yields 

, m 2 /mi 

taniJw = - =, (6.67) 

yl - (m 2 /mi) 2 

which reduces to 

^ max = sin" 1 ( — j . (6.68) 

Hence, we conclude that when mi > m 2 there is a maximum possible value of the scat- 
tering angle, \Jj, in the laboratory frame. This maximum value is always less than n/2, 
which implies that there is no backward scattering (i.e., ij) > n/2) at all when mi > m 2 . 
For the special case when rrti = m 2 , the maximum scattering angle is n/2. However, for 
m.! < m 2 there is no maximum value, and the scattering angle in the laboratory frame can 
thus range all the way to n. 

Equations (6.57)-(6.64) enable us to relate the particle energies and scattering angles 
in the laboratory frame to those in the center of mass frame. In general, these relationships 
are fairly complicated. However, there are two special cases in which the relationships 
become much simpler. 

The first special case is when m 2 3> mi. In this limit, it is easily seen from Equa- 
tions (6.57)-(6.64) that the second mass is stationary both before and after the collision, 
and that the center of mass frame coincides with the laboratory frame (since the energies 
and scattering angles in the two frames are the same). Hence, the simple analysis outlined 
in Section 6.4 is applicable in this case. 

The second special case is when mi = m 2 . In this case, Equation (6.62) yields 

tani|) = S1 " 9 : = tan(9/2). (6.69) 
cos + 1 

Hence, 

i|) = \- (6.70) 

In other words, the scattering angle of the first particle in the laboratory frame is half of 
the scattering angle in the center of mass frame. The above equation can be combined 
with Equation (6.64) to give 

A> + C = j. (6.71) 

Thus, in the laboratory frame, the two particles move off at right-angles to one another 
after the collision. Equation (6.57) yields 

£=2E. (6.72) 
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In other words, the total energy in the laboratory frame is twice that in the center of mass 
frame. According to Equations (6.58) and (6.59), 

E, = Ej = E 2 = E£ = j. (6.73) 

Hence, the total energy in the center of mass frame is divided equally between the two 
particles. Finally, Equations (6.60) and (6.61) give 

E{ = = cos 2 (9/2) E = cos 2 il) £, (6.74) 

E[ = ~ c ° sd ^j e = S m 2 (9/2) E = sm 2 i\> E. (6.75) 

Thus, in the laboratory frame, the unequal energy distribution between the two particles 
after the collision is simply related to the scattering angle 

What is the angular distribution of scattered particles when a beam of particles of 
the first type scatter off stationary particles of the second type? Well, we can define a 
differential scattering cross-section, dff(\Jj)/dO', in the laboratory frame, where da' = 
2n simjj dip is an element of solid angle in this frame. Thus, (dcr(\p)/ da') dCl' is the effec- 
tive cross-sectional area in the laboratory frame for scattering into the range of scattering 
angles to \p + di^. Likewise, (da(9)/dO) dQ. is the effective cross-sectional area in the 
center of mass frame for scattering into the range of scattering angles 9 to 9 + d9. Note 
that dO = 2n sin 9 d9. However, a cross-sectional area is not changed when we transform 
between different inertial frames. Hence, we can write 

provided that tJj and 9 are related via Equation (6.62). This equation can be rearranged to 
give 

dff(ip) dO da(9) 



da' da' da ' 

or 

dff(i^) _ sin 9 d9 da(9) 
da' ~ sinij) d^ da 



(6.77) 
(6.78) 



The above equation allows us to relate the differential scattering cross-section in the labo- 
ratory frame to that in the center of mass frame. In general, this relationship is extremely 
complicated. However, for the special case where the masses of the two types of parti- 
cles are equal, we have seen that i|) = 9/2 [see Equation (6.70)]. Hence, it follows from 
Equation (6.78) that 

da(\p) A , da(9 =2\|j) 

-^ = 4cos4. L_W (6.79) 
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Let us now consider some specific examples. We saw earlier that, in the center of 
mass frame, the differential scattering cross-section for impenetrable spheres is [see Equa- 
tion (6.35)] 

where a is the sum of the radii. According to Equation (6.79), the differential scattering 
cross-section (for equal mass spheres) in the laboratory frame is 

^ = a 2 cos*. (6.81) 
all' 

Note that this cross-section is negative for i[> > n/2. This just tells us that there is no 
scattering with scattering angles greater than n/2 {i.e., there is no backward scattering). 
Comparing Equations (6.80) and (6.81), we can see that the scattering is isotropic in the 
center of mass frame, but appears concentrated in the forward direction in the laboratory 
frame. We can integrate Equation (6.81) over all solid angles to obtain the total scattering 
cross-section in the laboratory frame. Note that we only integrate over angular regions 
where the differential scattering cross-section is positive. Doing this, we get 

o- = 7ta 2 , (6.82) 

which is the same as the total scattering cross-section in the center of mass frame [see 
Equation (6.36)]. This is a general result. The total scattering cross-section is frame inde- 
pendent, since a cross-sectional area is not modified by switching between different frames 
of reference. 

As we have seen, the Rutherford scattering cross-section takes the form [see Equa- 
tion (6.46)] 

da q 4 1 

dO = 16(471 e ) 2 E 2 sin 4 (9/2) (6 ' 83) 

in the center of mass frame. It follows, from Equation (6.79), that the Rutherford scattering 
cross-section (for equal mass particles) in the laboratory frame is written 

da _ q 4 cos^ 

dO' (47re ) 2 £ 2 sinV ' 

Here, we have made use of the fact that 8 = 2E for equal mass particles [see Equa- 
tion (6.72)]. Note, again, that this cross-section is negative for 4> > n/2, indicating the 
absence of backward scattering. 



6.6 Exercises 

6.1. A particle subject to a repulsive force varying as 1 /r 3 is projected from infinity with a velocity 
that would carry it to a distance a from the center of force, if it were directed toward the 
latter. Actually, it is projected along a line whose closest distance from the center of force 
would be b if there were no repulsion. Prove that the particle's least distance from the center 
is Va 2 + b 2 , and that the angle between the two asymptotes of its path is Ttb/Va 2 + b 2 . 
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6.2. A particle subject to a repulsive force varying as 1 /r 5 is projected from infinity with a velocity 
V that would carry it to a distance a from the center of force, if it were directed toward the 
latter. Actually, it is projected along a line whose closest distance from the center of force 
would be b if there were no repulsion. Show that the least velocity of the particle is 



b 2 

Q 1 




1/2 



6.3. Using the notation of Section 6.2, show that the angular momentum of a two-body system 
takes the form 

L = Mr cm x r cm ffirxr, 

where M = mi + m 2 . 

6.4. Consider the case of Rutherford scattering in the event that mi > m 2 . Demonstrate that the 
differential scattering cross-section in the laboratory frame is approximately 



dO' 4 (4ne ) 2 £ 2 n _ _ ^A|, max ) 2 ] 2 n _ (^/tlw) 2 ] 1 / 2 ' 
where i|; max = m 2 /mi . 

6.5. Show that the energy distribution of particles recoiling from an elastic collision is always 
directly proportional to the differential scattering cross-section in the center of mass frame. 

6.6. It is found experimentally that in the elastic scattering of neutrons by protons (m n ~ m p ) 
at relatively low energies the energy distribution of the recoiling protons in the laboratory 
frame is constant up to a maximum energy, which is the energy of the incident neutrons. 
What is the angular distribution of the scattering in the center of mass frame? 

6.7. The most energetic a-particles available to Earnst Rutherford and his colleagues for the 
famous Rutherford scattering experiment were 7.7 MeV. For the scattering of 7.7 MeV a- 
particles from Uranium 238 nuclei (initially at rest) at a scattering angle in the laboratory 
frame of 90°, find the following (in the laboratory frame, unless otherwise specified): 

(a) The recoil scattering angle of the Uranium nucleus. 

(b) The scattering angles of the a-particle and Uranium nucleus in the center of mass frame. 

(c) The kinetic energies of the scattered a-particle and Uranium nucleus (in MeV) . 

(d) The impact parameter, b. 

(e) The distance of closest approach. 

(f) The differential scattering cross-section at 90° . 

6.8. Consider scattering by the repulsive potential U = k/r 2 (where k > 0) viewed in the center 
of mass frame. Demonstrate that the differential scattering cross-section is 

do" k 1 (1-9/7T) 



dCl En sin 9 (9/tt) 2 (2 - 9/tt) 2 ' 
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7 Rotating Reference Frames 



7. 1 Introduction 

As we saw in Chapter 2, Newton's second law of motion is only valid in inertial frames 
of reference. Unfortunately, we are sometimes forced to observe motion in non-inertial 
reference frames. For instance, it is most convenient for us to observe the motions of the 
objects in our immediate vicinity in a reference frame which is fixed relative to the surface 
of the Earth. Such a frame is non-inertial in nature, since it accelerates with respect to 
a standard inertial frame due to the Earth's daily rotation about its axis. (Note that the 
accelerations of this frame due to the Earth's orbital motion about the Sun, or the Sun's 
orbital motion about the galactic center, etc., are negligible compared to the acceleration 
due to the Earth's axial rotation.) Let us now investigate motion in a rotating reference 
frame. 



7.2 Rotating Reference Frames 

Suppose that a given object has position vector r in some non-rotating inertial reference 
frame. Let us observe the motion of this object in a non-inertial reference frame which 
rotates with constant angular velocity D. about an axis passing through the origin of the 
inertial frame. Suppose, first of all, that our object appears stationary in the rotating 
reference frame. Hence, in the non-rotating frame, the object's position vector r will appear 
to precess about the origin with angular velocity O. It follows, from Equation (A. 49), that 
in the non-rotating reference frame 

^=Hxr. (7.1) 
dt 

Suppose, now, that our object appears to move in the rotating reference frame with instan- 
taneous velocity v'. It is fairly obvious that the appropriate generalization of the above 
equation is simply 

— = v' + n x r. (7.2) 
dt 

Let and d/dt and d/dt' denote apparent time derivatives in the non-rotating and ro- 
tating frames of reference, respectively. Since an object which is stationary in the rotating 
reference frame appears to move in the non-rotating frame, it is clear that d/dt ^ d/dt'. 
Writing the apparent velocity, v', of our object in the rotating reference frame as dr/dt', 
the above equation takes the form 

dr dr ^ , „ N 

- = - + nxr, (7.3, 
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or 

S = 5F + nx ' (7 ' 4) 

since r is a general position vector. Equation (7.4) expresses the relationship between 
apparent time derivatives in the non-rotating and rotating reference frames. 

Operating on the general position vector r with the time derivative (7.4), we get 

v = v' + nxr. (7.5) 

This equation relates the apparent velocity, v = dr/dt, of an object with position vector 
r in the non-rotating reference frame to its apparent velocity, v' = dr/dt', in the rotating 
reference frame. 

Operating twice on the position vector r with the time derivative (7.4), we obtain 

a= ^A+nx^ (v' + Hxr), (7.6) 

or 

a = a' + Qx (Qxr) + 2Qx v'. (7.7) 

This equation relates the apparent acceleration, a = d 2 r/ dt 2 , of an object with position 
vector r in the non-rotating reference frame to its apparent acceleration, a' = d 2 r/dt /2 , in 
the rotating reference frame. 

Applying Newton's second law of motion in the inertial (i.e., non-rotating) reference 
frame, we obtain 

ma = f. (7.8) 

Here, m is the mass of our object, and f is the (non-fictitious) force acting on it. Note that 
these quantities are the same in both reference frames. Making use of Equation (7.7), the 
apparent equation of motion of our object in the rotating reference frame takes the form 

ma' = f-mfl x (H x r) -2mQ x v'. (7.9) 

The last two terms in the above equation are so-called "fictitious forces". Such forces 
are always needed to account for motion observed in non-inertial reference frames. Note 
that fictitious forces can always be distinguished from non-fictitious forces in Newtonian 
dynamics because the former have no associated reactions. Let us now investigate the two 
fictitious forces appearing in Equation (7.9). 



7.3 Centrifugal Acceleration 

Let our non-rotating inertial frame be one whose origin lies at the center of the Earth, and 
let our rotating frame be one whose origin is fixed with respect to some point, of latitude 
A, on the Earth's surface — see Figure 7.1. The latter reference frame thus rotates with 
respect to the former (about an axis passing through the Earth's center) with an angular 
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Figure 7.1: Inertial and non-inertial reference frames. 



velocity vector, Q, which points from the center of the Earth toward its north pole, and is 
of magnitude 

2tc 

Q = — ;— = 7.27 x 1(T 5 rad./s. (7.10) 
24 hrs 

Consider an object which appears stationary in our rotating reference frame: i.e., an 
object which is stationary with respect to the Earth's surface. According to Equation (7.9), 
the object's apparent equation of motion in the rotating frame takes the form 

ma' = f-mQ x (Q x r). (7.11) 

Let the non-fictitious force acting on our object be the force of gravity, f = mg. Here, 
the local gravitational acceleration, g, points directly toward the center of the Earth. It 
follows, from the above, that the apparent gravitational acceleration in the rotating frame 
is written 

g' = g-Qx(QxR), (7.12) 

where R is the displacement vector of the origin of the rotating frame (which lies on the 
Earth's surface) with respect to the center of the Earth. Here, we are assuming that our 
object is situated relatively close to the Earth's surface (i.e., r ~ R). 

It can be seen, from Equation (7.12), that the apparent gravitational acceleration of a 
stationary object close to the Earth's surface has two components. First, the true gravita- 
tional acceleration, g, of magnitude g ~ 9.8 m/s 2 , which always points directly toward the 
center of the Earth. Second, the so-called centrifugal acceleration, —D. x (Q x R). This 
acceleration is normal to the Earth's axis of rotation, and always points directly away from 
this axis. The magnitude of the centrifugal acceleration is D 2 p = D 2 R cos A, where p is 
the perpendicular distance to the Earth's rotation axis, and R = 6.37 x 10 6 m is the Earth's 
radius — see Figure 7.2. 
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Figure 7.2: Centrifugal acceleration. 

It is convenient to define Cartesian axes in the rotating reference frame such that the 
z'-axis points vertically upward, and x'- and y'-axes are horizontal, with the x'-axis point- 
ing directly northward, and the y'-axis pointing directly westward — see Figure 7.1. The 
Cartesian components of the Earth's angular velocity are thus 



£1 = CI (cos A, 0, sin A), 



whilst the vectors R and g are written 



R = (0, 0, R), 
g = (0,0,-g), 



(7.13) 

(7.14) 
(7.15) 



respectively. It follows that the Cartesian coordinates of the apparent gravitational accel- 
eration, (7.12), are 



g' = (-Q 2 R cos A sin A, 0, -g + CI 2 R cos 2 A) . 
The magnitude of this acceleration is approximately 

g' ~ g - CI 2 R cos 2 A ~ 9.8 - 0.034 cos 2 A m/s 2 . 



(7.16) 



(7.17) 



According to the above equation, the centrifugal acceleration causes the magnitude of the 
apparent gravitational acceleration on the Earth's surface to vary by about 0.3%, being 
largest at the poles, and smallest at the equator. This variation in apparent gravitational 
acceleration, due (ultimately) to the Earth's rotation, causes the Earth itself to bulge slightly 
at the equator (see Section 12.6), which has the effect of further intensifying the variation, 
since a point on the surface of the Earth at the equator is slightly further away from the 
Earth's center than a similar point at one of the poles (and, hence, the true gravitational 
acceleration is slightly weaker in the former case) . 
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Another consequence of centrifugal acceleration is that the apparent gravitational ac- 
celeration on the Earth's surface has a horizontal component aligned in the north/south 
direction. This horizontal component ensures that the apparent gravitational acceleration 
does not point directly toward the center of the Earth. In other words, a plumb-line on 
the surface of the Earth does not point vertically downward, but is deflected slightly away 
from a true vertical in the north/south direction. The angular deviation from true vertical 
can easily be calculated from Equation (7.16): 

9 dev ~ sin(2A) ~ -0.1° sin(2A). (7.18) 

2 9 

Here, a positive angle denotes a northward deflection, and vice versa. Thus, the deflection 
is southward in the northern hemisphere (i.e., A > 0) and northward in the southern hemi- 
sphere (i.e., A < 0). The deflection is zero at the poles and at the equator, and reaches its 
maximum magnitude (which is very small) at middle latitudes. 



7.4 Coriolis Force 

We have now accounted for the first fictitious force, —ran x (12 x r), in Equation (7.9). 
Let us now investigate the second, which takes the form — 2 m Q x v', and is called the 
Coriolis force. Obviously, this force only affects objects which are moving in the rotating 
reference frame. 

Consider a particle of mass m free-falling under gravity in our rotating reference frame. 
As before, we define Cartesian axes in the rotating frame such that the z'-axis points ver- 
tically upward, and the x'- and u'-axes are horizontal, with the x'-axis pointing directly 
northward, and the y'-axis pointing directly westward. It follows, from Equation (7.9), 
that the Cartesian equations of motion of the particle in the rotating reference frame take 
the form: 

x' = 20sinAy / , (7.19) 
y' = -2Q sin Ax' + 2Q cosAz', (7.20) 
z! = -g-2Q cosAy'. (7.21) 

Here, = d/dt, and g is the local acceleration due to gravity. In the above, we have 
neglected the centrifugal acceleration, for the sake of simplicity. This is reasonable, since 
the only effect of the centrifugal acceleration is to slightly modify the magnitude and 
direction of the local gravitational acceleration. We have also neglected air resistance, 
which is less reasonable. 

Consider a particle which is dropped (at t = 0) from rest a height h above the Earth's 
surface. The following solution method exploits the fact that the Coriolis force is much 
smaller in magnitude that the force of gravity: hence, Q. can be treated as a small param- 
eter. To lowest order (i.e., neglecting d), the particle's vertical motion satisfies z! = — g, 
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which can be solved, subject to the initial conditions, to give 

qt 2 

z' = H-^-. (7.22) 

Substituting this expression into Equations (7.19) and (7.20), neglecting terms involving 
CI 2 , and solving subject to the initial conditions, we obtain x' ~ 0, and 

t 3 

u'~-gOcosAy. (7.23) 

In other words, the particle is deflected eastward (i.e., in the negative y '-direction). Now, 
the particle hits the ground when t ~ ^Jl h/g. Hence, the net eastward deflection of the 
particle as strikes the ground is 

CI f8h 3 \ ]/1 

deast = J cos A f— -J . (7.24) 

Note that this deflection is in the same direction as the Earth's rotation (i.e., west to east), 
and is greatest at the equator, and zero at the poles. A particle dropped from a height of 
100 m at the equator is deflected by about 2.2 cm. 

Consider a particle launched horizontally with some fairly large velocity 

V = V (cos6, -sinB, 0). (7.25) 

Here, 6 is the compass bearing of the velocity vector (so north is 0°, east is 90°, etc.). 
Neglecting any vertical motion, Equations (7.19) and (7.20) yield 

\V ^ -2 O V sin A sinB, (7.26) 
~ -2 a V sin A cose, (7.27) 

which can be integrated to give 

v x / ~ V cose- 2OV sin A sin 6 1, (7.28) 
v y > ~ -V sine -2D V sin A cos 6 1. (7.29) 

To lowest order in CI, the above equations are equivalent to 

v x / ~ V cos(6 + 2C1 sin At), (7.30) 
v y > ~ -V sin(e + 2D sin At). (7.31) 

If follows that the Coriolis force causes the compass bearing of the particle's velocity vector 
to rotate steadily as time progresses. The rotation rate is 

de 

— ~20sinA. (7.32) 
dt 
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Figure 7.3: A cyclone in the northern hemisphere. 



Hence, the rotation is clockwise (looking from above) in the northern hemisphere, and 
counter-clockwise in the southern hemisphere. The rotation rate is zero at the equator, and 
greatest at the poles. 

The Coriolis force has a significant effect on terrestrial weather patterns. Near equa- 
torial regions, the intense heating of the Earth's surface due to the Sun results in hot air 
rising. In the northern hemisphere, this causes cooler air to move in a southerly direc- 
tion toward the equator. The Coriolis force deflects this moving air in a clockwise sense 
(looking from above), resulting in the trade winds, which blow toward the southwest. In 
the southern hemisphere, the cooler air moves northward, and is deflected by the Cori- 
olis force in a counter-clockwise sense, resulting in trade winds which blow toward the 
northwest. 

Furthermore, as air flows from high to low pressure regions, the Coriolis force deflects 
the air in a clockwise/counter-clockwise manner in the northern/southern hemisphere, 
producing cyclonic rotation — see Figure 7.3. It follows that cyclonic rotation is counter- 
clockwise in the northern hemisphere, and clockwise in the southern hemisphere. Thus, this is 
the direction of rotation of tropical storms (e.g., hurricanes, typhoons) in each hemisphere. 



7.5 Foucault Pendulum 



Consider a pendulum consisting of a compact mass m suspended from a light cable of 
length I in such a manner that the pendulum is free to oscillate in any plane whose normal 
is parallel to the Earth's surface. The mass is subject to three forces: first, the force of 
gravity mg, which is directed vertically downward (we are again ignoring centrifugal 
acceleration); second, the tension T in the cable, which is directed upward along the 
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cable; and, third, the Coriolis force. It follows that the apparent equation of motion of the 
mass, in a frame of reference which co-rotates with the Earth, is [see Equation (7.9)] 

mr'=mg + T-2mQxr'. (7.33) 

Let us define our usual Cartesian coordinates (x f ,y' } z'), and let the origin of our coor- 
dinate system correspond to the equilibrium position of the mass. If the pendulum cable is 
deflected from the downward vertical by a small angle then it is easily seen that x' ~ 16, 
y' ~ 16, and z' ~ 16 2 . In other words, the change in height of the mass, z', is negligi- 
ble compared to its horizontal displacement. Hence, we can write z' ~ 0, provided that 
6 1 . The tension T has the vertical component T cos 6 ~ T, and the horizontal com- 
ponent T hz = — T sinBr'/r' ~ — Tr'/l, since sin 9 ~ r'/l — see Figure 7.4. Hence, the 
Cartesian equations of motion of the mass are written [cf., Equations (7.19)-(7.21)] 

x' = --^x'+2Q sin Ai)', (7.34) 
I m 

y' = — y'-2Q sinAx', (7.35) 
I m 

= — -g-2Q cos At)'. (7.36) 
m 

To lowest order in Q. (i.e., neglecting d), the final equation, which is just vertical force 
balance, yields T ~ mg. Hence, Equations (7.34) and (7.35) reduce to 

x' ~ -|x'+20 sinAy', (7.37) 
y' ~ -|u'-2D sinAx'. (7.38) 

Let 

s=x' + iy / . (7.39) 
Equations (7.37) and (7.38) can be combined to give a single complex equation for s: 

s' = -|s-i2asinAs. (7.40) 

Let us look for a sinusoidally oscillating solution of the form 

s = s e- [w \ (7.41) 

Here, o> is the (real) angular frequency of oscillation, and so is an arbitrary complex con- 
stant. Equations (7.40) and (7.41) yield the following quadratic equation for w. 

o) 2 -2Q sin A cu- | = 0. (7.42) 
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The solutions are approximately 



cu ± ~ O sinAi ^ (7.43) 



where we have neglected terms involving Q 2 . Hence, the general solution of (7.41) takes 
the form 

s = s+e - ia ' +t + s_e- ia '- t , (7.44) 

where s + and s are two arbitrary complex constants. 

Making the specific choice s+ = s = a/2, where a is real, the above solution reduces 

to 

s = ae- iQ sinAt cos ( </? t) . (7.45) 



I 

Now, it is clear from Equation (7.39) that x' and u' are the real and imaginary parts of s, 
respectively. Thus, it follows from the above that 

x' = a cos(Q sinAt) cos ( W^t ] , (7.46) 



y' = -a sin(Q sin At) cos (Jj 1 ^ ■ (7.47) 

These equations describe sinusoidal oscillations, in a plane whose normal is parallel to the 
Earth's surface, at the standard pendulum frequency \J~g/l. The Coriolis force, however, 
causes the plane of oscillation to slowly precess at the angular frequency D. sin A. The 
period of the precession is 

2n 24 

T = = -£L hrs. (7.48) 

il smA smA 
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For example, according to the above equations, the pendulum oscillates in the x'-direction 
(i.e., north/south) at t ~ 0, in the y'-direction (i.e., east/west) at t ~ T/4, in the x'- 
direction again at t ~ T/2, etc. The precession is clockwise (looking from above) in the 
northern hemisphere, and counter-clockwise in the southern hemisphere. 

The precession of the plane of oscillation of a pendulum, due to the Coriolis force, is 
used in many museums and observatories to demonstrate that the Earth is rotating. This 
method of making the Earth's rotation manifest was first devised by Foucault in 1851. 

7.6 Exercises 

7.1. A pebble is dropped down an elevator shaft in the Empire State Building (h = 1250 ft, 
latitude = 41° N). Find the pebble's horizontal deflection (magnitude and direction) due to 
the Coriolis force at the bottom of the shaft. Neglect air resistance. 

7.2. If a bullet is fired due east, at an elevation angle a, from a point on the Earth whose latitude 
is +A show that it will strike the Earth with a lateral deflection 4Qv| sin A sin 2 a cos a/g 2 . 
Is the deflection northward or southward? Here, O is the Earth's angular velocity, vo is the 
bullet's initial speed, and g is the acceleration due to gravity. Neglect air resistance. 

7.3. A particle is thrown vertically with initial speed vo, reaches a maximum height, and falls back 
to the ground. Show that the horizontal Coriolis deflection of the particle when it returns to 
the ground is opposite in direction, and four times greater in magnitude, than the Coriolis 
deflection when it is dropped at rest from the same maximum height. Neglect air resistance. 

7.4. The surface of the Diskworld is a disk which rotates (counter-clockwise looking down) with 
angular frequency D. about a perpendicular axis passing through its center. Diskworld grav- 
itational acceleration is of magnitude g, and is everywhere directed normal to the disk. A 
projectile is launched from the surface of the disk at a point whose radial distance from the 
axis of rotation is R. The initial velocity of the projectile (in a co-rotating frame) is of mag- 
nitude vo, is directly radially outwards, and is inclined at an angle a to the horizontal. What 
are the radial and tangential displacements of the impact point from that calculated by ne- 
glecting the centrifugal and Coriolis forces? Neglect air resistance. You may assume that the 
displacements are small compared to both R and the horizontal range of the projectile. 

7.5. Demonstrate that the Coriolis force causes conical pendulums to rotate clockwise and counter- 
clockwise with slightly different angular frequencies. What is the frequency difference as a 
function of terrestrial latitude? 

7.6. A satellite is in a circular orbit of radius a about the Earth. Let us define a set of co-moving 
Cartesian coordinates, centered on the satellite, such that the x-axis always points toward the 
center of the Earth, the y-axis in the direction of the satellite's orbital motion, and the z-axis 
in the direction of the satellite's orbital angular velocity, cu. Demonstrate that the equation 
of motion of a small mass in orbit about the satellite are 

x, = 3w 2 x + 2a)y, 
ij = — 2 cu x, 
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assuming that \x\/a <C 1 and \y\/a <C 1 . You may neglect the gravitational attraction between 
the satellite and the mass. Show that the mass executes a retrograde (i.e., in the oppo- 
site sense to the satellite's orbital rotation) elliptical orbit about the satellite whose period 
matches that of the satellite's orbit, and whose major and minor axes are in the ratio 2:1, 
and are aligned along the y- and x-axes, respectively. 
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8 Rigid Body Rotation 



8.1 Introduction 

This chapter examines the rotation of rigid bodies in three dimensions. 

8.2 Fundamental Equations 

We can think of a rigid body as a collection of a large number of small mass elements 
which all maintain a fixed spatial relationship with respect to one another. Let there be N 
elements, and let the ith element be of mass mi, and instantaneous position vector iv The 
equation of motion of the ith element is written 

d 2 r- ¥i 

mt— J^j+Fi. (8.1) 

j=1,N 

Here, is the internal force exerted on the ith element by the jth element, and F t the 
external force acting on the ith element. The internal forces fy represent the stresses 
which develop within the body in order to ensure that its various elements maintain a 
constant spatial relationship with respect to one another. Of course, fy = — fj i} by Newton's 
third law. The external forces represent forces which originate outside the body. 

Repeating the analysis of Section 2.7, we can sum Equation (8.1) over all mass elements 
to obtain 

M^=F. (8.2) 
dt 2 

Here, M = ^ i=1 N is the total mass, r cm the position vector of the center of mass [see 
Equation (2.27)], and F = Y-\m n ^ tne tota ^ external force. It can be seen that the center 
of mass of a rigid body moves under the action of the external forces like a point particle 
whose mass is identical with that of the body. 

Again repeating the analysis of Section 2.7, we can sum r t x Equation (8.1) over all 
mass elements to obtain 

dL 

— = T. (8.3) 
at 

Here, L = ^I i=1 N m.t x di\/dt is the total angular momentum of the body (about the 
origin), and T = ^ i=1 N ii x V { the total external torque (about the origin). Note that the 
above equation is only valid if the internal forces are central in nature. However, this is not 
a particularly onerous constraint. Equation (8.3) describes how the angular momentum of 
a rigid body evolves in time under the action of the external torques. 

In the following, we shall only consider the rotational motion of rigid bodies, since 
their translational motion is similar to that of point particles [see Equation (8.2)], and, 
therefore, fairly straightforward in nature. 
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8.3 Moment of Inertia Tensor 



Consider a rigid body rotating with fixed angular velocity cu about an axis which passes 
through the origin — see Figure 8.1. Let r { be the position vector of the ith mass ele- 
ment, whose mass is mi. We expect this position vector to precess about the axis of rota- 
tion (which is parallel to cu) with angular velocity cu. It, therefore, follows from Equa- 
tion (A.49) that 

-± = ivxr i . (8.4) 
at 

Thus, the above equation specifies the velocity, v t = drt/dt, of each mass element as the 
body rotates with fixed angular velocity tu about an axis passing through the origin. 
The total angular momentum of the body (about the origin) is written 



L = Y_ m i r i x -tt = ^ m t r : x (cu x r t ) = ^ tru [r 2 cu - (r t • cu) rj , 

i=l,N i=1, N i=l,N 

where use has been made of Equation (8.4), and some standard vector identities (see 
Section A. 10). The above formula can be written as a matrix equation of the form 



(8.5) 



( U 

V u 



( Ixx I 

U I 



yy 

Izy Iz 



l yz 
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cu x 
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CUy 
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V 




/ 



(8.6) 



where 



L yy 



i=1,N 
v=1,N 



(u 2 + z 2 ) dm, 
(x 2 + z 2 ) dm, 



(8.7) 
(8.8) 
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Izz = ^(x i 2 + y i 2 )m i = 



lyz — Izy 



= ~ y XjUjTTU = 



i=l,N 



Izx = - ^ Xi Zi TUi = - 



i=l,N 



(x 2 +y 2 )dm, 
xy dm, 
y z dm, 
xzdm. 



(8.9) 
(8.10) 
(8.11) 
(8.12) 



Here, I xx is called the moment of inertia about the x-axis, l yy the moment of inertia about 
the u-axis, l xy the xy product of inertia, l yz the yz product of inertia, etc. The matrix of 
the Iij values is known as the moment of inertia tensor. 1 Note that each component of the 
moment of inertia tensor can be written as either a sum over separate mass elements, or as 
an integral over infinitesimal mass elements. In the integrals, dm = p dV, where p is the 
mass density, and dV a volume element. Equation (8.6) can be written more succinctly as 



L = Icu. 



(8.13) 



Here, it is understood that L and cu are both column vectors, and I is the matrix of the ly 
values. Note that I is a real symmetric matrix: i.e., I- = and Ijt = Iij- 

In general, the angular momentum vector, L, obtained from Equation (8.13), points in 
a different direction to the angular velocity vector, cu. In other words, L is generally not 
parallel to cu. 

Finally, although the above results were obtained assuming a fixed angular velocity, 
they remain valid at each instant in time if the angular velocity varies. 



8.4 Rotational Kinetic Energy 

The instantaneous rotational kinetic energy of a rotating rigid body is written 




(8.14) 



Making use of Equation (8.4), and some vector identities (see Section A.9), the kinetic 
energy takes the form 

K = \ y tth (cu x r t ) • (cu x r t ) = ^ cu Y rniTi x (cu x rO. (8.15) 

i=l,N i=l,N 

1 A tensor is the two-dimensional generalization of a vector. However, for present purposes, we can simply 
think of a tensor as another name for a matrix. 
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Hence, it follows from (8.5) that 

K = ^U)L. (8.16) 
Making use of Equation (8.13), we can also write 

K = jCV J Icv. (8.17) 

Here, iv J is the row vector of the Cartesian components co x , co y , co z , which is, of course, 
the transpose (denoted T ) of the column vector cu. When written in component form, the 
above equation yields 

K = - (l xx <x>l + l yy o)y + l zz cv z + 2 I xy co x co y + 2 l yz co y co z + 2 I xz co x co z ) . (8.18) 

8.5 Matrix Eigenvalue Theory- 
it is time to review a little matrix theory. Suppose that A is a real symmetric matrix of 
dimension n. If follows that A* = A and A T = A, where * denotes a complex conjugate, 
and T denotes a transpose. Consider the matrix equation 

Ax = Ax. (8.19) 

Any column vector x which satisfies the above equation is called an eigenvector of A. Like- 
wise, the associated number A is called an eigenvalue of A. Let us investigate the properties 
of the eigenvectors and eigenvalues of a real symmetric matrix. 
Equation (8.19) can be rearranged to give 

(A-Al)x = 0, (8.20) 

where 1 is the unit matrix. The above matrix equation is essentially a set of n homogeneous 
simultaneous algebraic equations for the n components of x. A well-known property of 
such a set of equations is that it only has a non-trivial solution when the determinant of 
the associated matrix is set to zero. Hence, a necessary condition for the above set of 
equations to have a non-trivial solution is that 

|A — A 1| = 0. (8.21) 

The above formula is essentially an nth-order polynomial equation for A. We know that 
such an equation has n (possibly complex) roots. Hence, we conclude that there are n 
eigenvalues, and n associated eigenvectors, of the n-dimensional matrix A. 

Let us now demonstrate that the n eigenvalues and eigenvectors of the real symmetric 
matrix A are all real. We have 

AXi = \x u (8.22) 
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and, taking the transpose and complex conjugate, 

x* T A = A*x* T , (8.23) 

where x t and A t are the ith eigenvector and eigenvalue of A, respectively. Left multiplying 
Equation (8.22) by x* T , we obtain 

x* T Axi = AiX* T Xi. (8.24) 
Likewise, right multiplying (8.23) by x { , we get 

x* T Axi = A*x* T Xi. (8.25) 
The difference of the previous two equations yields 

(A i -A*)^ T x i = 0. (8.26) 

It follows that Ai = A*, since x* T Xi (which is x* • x t in vector notation) is positive definite. 
Hence, At is real. It immediately follows that Xi is real. 

Next, let us show that two eigenvectors corresponding to two different eigenvalues are 
mutually orthogonal. Let 

Axi = AiXt, (8.27) 

Axj = AjXj, (8.28) 

where Ai ^ Aj. Taking the transpose of the first equation and right multiplying by Xj, and 
left multiplying the second equation by xj, we obtain 

x[Axj = AixTxj, (8.29) 

x[Axj = AjX^Xj. (8.30) 

Taking the difference of the above two equations, we get 

(Ai-Aj)x[xj =0. (8.31) 

Since, by hypothesis, At ^ Aj, it follows that x-Xj = 0. In vector notation, this is the same 
as Xi • Xj = 0. Hence, the eigenvectors x t and Xj are mutually orthogonal. 

Suppose that A t = Aj = A. In this case, we cannot conclude that x[xj = by the 
above argument. However, it is easily seen that any linear combination of Xi and Xj is an 
eigenvector of A with eigenvalue A. Hence, it is possible to define two new eigenvectors of 
A, with the eigenvalue A, which are mutually orthogonal. For instance, 

x! = Xi , (8.32) 

*J = X ;-(xj^) Xi - (8 - 33) 

It should be clear that this argument can be generalized to deal with any number of eigen- 
values which take the same value. 

In conclusion, a real symmetric n-dimensional matrix possesses n real eigenvalues, with 
n associated real eigenvectors, which are, or can be chosen to be, mutually orthogonal. 



110 



NEWTONIAN DYNAMICS 



8.6 Principal Axes of Rotation 

We have seen that the moment of inertia tensor, I, defined in Section 8.3, takes the form 
of a real symmetric three-dimensional matrix. It therefore follows, from the matrix theory 
that we have just reviewed, that the moment of inertia tensor possesses three mutually 
orthogonal eigenvectors which are associated with three real eigenvalues. Let the ith eigen- 
vector (which can be normalized to be a unit vector) be denoted (bi, and the ith eigenvalue 
Ai. It then follows that 

Id& i = \ i d) i , (8.34) 

for i = 1,3. 

The directions of the three mutually orthogonal unit vectors (b { define the three so- 
called principal axes of rotation of the rigid body under investigation. These axes are 
special because when the body rotates about one of them (i.e., when cu is parallel to one 
of them) the angular momentum vector L becomes parallel to the angular velocity vector 
cu. This can be seen from a comparison of Equation (8.13) and Equation (8.34). 

Suppose that we reorient our Cartesian coordinate axes so the they coincide with the 
mutually orthogonal principal axes of rotation. In this new reference frame, the eigenvec- 
tors of I are the unit vectors, e x , e y , and e z , and the eigenvalues are the moments of inertia 
about these axes, I xx , I yy , and I zz , respectively. These latter quantities are referred to as 
the principal moments of inertia. Note that the products of inertia are all zero in the new 
reference frame. Hence, in this frame, the moment of inertia tensor takes the form of a 
diagonal matrix: i.e., 

( Ixx \ 

I = lyy . (8.35) 

\ o I 2Z J 

Incidentally, it is easy to verify that e x , e y , and e z are indeed the eigenvectors of the above 
matrix, with the eigenvalues I xx , I yy , and I zz , respectively, and that L = Icu is indeed 
parallel to cu whenever cu is directed along e x , e y , or e z . 

When expressed in our new coordinate system, Equation (8.13) yields 

L = (Ixx CUx, Iyy CU y , I zz CU Z ) , (8.36) 

whereas Equation (8.18) reduces to 

K = 1 (l xx tu x 2 + I yy cu y 2 + I zz cu z 2 ) . (8.37) 

In conclusion, there are many great simplifications to be had by choosing a coordinate 
system whose axes coincide with the principal axes of rotation of the rigid body under 
investigation. But how do we determine the directions of the principal axes in practice? 

Well, in general, we have to solve the eigenvalue equation 



Icu = Acu, 



(8.38) 
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or 

/ 



/ Ixx A I x y 

ly* lyy ~~ ^ 




, (8.39) 

V Izx Izy Izz-A J V COSy 7 V / 

where db = (cos <x, cos |3, cosy), and cos 2 a + cos 2 |3 + cos 2 y = 1. Here, a is the angle 
the unit eigenvector subtends with the x-axis, |3 the angle it subtends with the u-axis, and 
y the angle it subtends with the z-axis. Unfortunately, the analytic solution of the above 
matrix equation is generally quite difficult. 

Fortunately, however, in many instances the rigid body under investigation possesses 
some kind of symmetry, so that at least one principal axis can be found by inspection. In 
this case, the other two principal axes can be determined as follows. 

Suppose that the z-axis is known to be a principal axes (at the origin) in some co- 
ordinate system. It follows that the two products of inertia l xz and I yz are zero [other- 
wise, (0, 0, 1) would not be an eigenvector in Equation (8.39)]. The other two principal 
axes must lie in the x-y plane: i.e., cosy = 0. It then follows that cos |3 = sin a, since 
cos 2 oc + cos 2 (3 + cos 2 y = 1 . The first two rows in the matrix equation (8.39) thus reduce 
to 

(I xx — A) cos oc+ l xy sin oc = 0, (8.40) 
l xy cosa+ [lyy — A) sin a = 0. (8.41) 

Eliminating A between the above two equations, we obtain 

l xy (1 - tan 2 oc) = (I xx - l yy ) tan oc. (8.42) 

But, tan(2 oc) = 2 tan ct/(1 — tan 2 oc). Hence, Equation (8.42) yields 

tan(2a)= ^ . (8.43) 

J-xx lyy 

There are two values of a, lying between — n/2 and n/2, which satisfy the above equation. 
These specify the angles, oc, that the two mutually orthogonal principal axes lying in the 
x-y plane make with the x-axis. Hence, we have now determined the directions of all three 
principal axes. Incidentally, once we have determined the orientation angle, oc, of a prin- 
cipal axis, we can then substitute back into Equation (8.40) to obtain the corresponding 
principal moment of inertia, A. 

As an example, consider a uniform rectangular lamina of mass m and sides a and b 
which lies in the x-y plane, as shown in Figure 8.2. Suppose that the axis of rotation 
passes through the origin (i.e., through a corner of the lamina). Since z = throughout 
the lamina, it follows from Equations (8.11) and (8.12) that I xz = l yz = 0. Hence, the 
z-axis is a principal axis. After some straightforward integration, Equations (8.7)-(8.10) 
yield 

Ixx = |mb 2 , (8.44) 
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Figure 8.2: A uniform rectangular laminar. 



1 7 

l vy = 3 ma 2 , (8-45) 

l xy = — mab, (8.46) 
Thus, it follows from Equation (8.43) that 

-MIA)- 

The above equation specifies the orientation of the two principal axes that lie in the x-y 
plane. For the special case where a = b, we get a = 7t/4, 3n/4: i.e., the two in-plane 
principal axes of a square lamina (at a corner) are parallel to the two diagonals of the 
lamina. 



8.7 Euler's Equations 

The fundamental equation of motion of a rotating body [see Equation (8.3)], 

dL 

T = — , (8.48) 
at 

is only valid in an inertial frame. However, we have seen that L is most simply expressed 
in a frame of reference whose axes are aligned along the principal axes of rotation of the 
body. Such a frame of reference rotates with the body, and is, therefore, non-inertial. Thus, 
it is helpful to define two Cartesian coordinate systems, with the same origins. The first, 
with coordinates x, y, z, is a fixed inertial frame — let us denote this the fixed frame. The 
second, with coordinates x', y', z', co-rotates with the body in such a manner that the x' -, 
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y '-, and z'-axes are always pointing along its principal axes of rotation — we shall refer to 
this as the body frame. Since the body frame co-rotates with the body, its instantaneous 
angular velocity is the same as that of the body. Hence, it follows from the analysis in 
Section 7.2 that 

dL dL 

— = — + cu x L. (8.49) 
dt dt' 

Here, d/dt is the time derivative in the fixed frame, and d/dt' the time derivative in the 
body frame. Combining Equations (8.48) and (8.49), we obtain 

T = 4~ + cu x L. (8.50) 
dt' 

Now, in the body frame let T = (T x /, T y /,T Z /) and cu = (ov, ay, tcv). It follows that 
L = (I x 'x' ov, l y/y > ay, I 2 / z / a z /), where I x / X /, ly, y > and I z / 2 / are the principal moments of 
inertia. Hence, in the body frame, the components of Equation (8.50) yield 

T x ' = I X 'X' "V — [ly'y' — I Z 'z') 0) y / 0) 2 /, (8.51) 

T y ' = ly'y' d) y / — (I z / Z / — I x 'x') o> z / o) x ', (8.52) 
T z ' = I 2 'z' 0> z ' ~ (Ix'x' ~~ Vy') Wx ' w y'i (8.53) 

where ' = d/dt'. Here, we have made use of the fact that the moments of inertia of a rigid 
body are constant in time in the co-rotating body frame. The above equations are known 
as Euler's equations. 

Consider a rigid body which is constrained to rotate about a fixed axis with constant 
angular velocity. It follows that dv = dy = dv = 0. Hence, Euler's equations, (8.51)- 
(8.53), reduce to 

T X ' = -(W - Iz'zO ay av, (8.54) 
T y ' = — (I Z 'z' — Ix'xO av av, (8.55) 
T 2 ' = -(Ix'x' - Iy'yO av ay. (8.56) 

These equations specify the components of the steady (in the body frame) torque exerted 
on the body by the constraining supports. The steady (in the body frame) angular momen- 
tum is written 

L = (I x 'x' ov, ly>y' ay, I 2 ' Z ' a> z /). (8.57) 

It is easily demonstrated that T = a) x L. Hence, the torque is perpendicular to both the 
angular velocity and the angular momentum vectors. Note that if the axis of rotation is 
a principal axis then two of the three components of u) are zero (in the body frame) . It 
follows from Equations (8.54)-(8.56) that all three components of the torque are zero. 
In other words, zero external torque is required to make the body rotate steadily about a 
principal axis. 

Suppose that the body is freely rotating: i.e., there are no external torques. Further- 
more, let the body be rotationally symmetric about the z'-axis. It follows that I x / X / = ly /yl = 
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Likewise, we can write I z / Z / = Iy. In general, however, Ij_ ^ Iy. Thus, Euler's equations 
yield 

I±-^f + (I||-IJavay = 0, (8.58) 



dt' 

I±-^f-(I||-I±)avav = 0, (8.59) 

dcu 7 ' 

— f = 0. (8.60) 
at' 

Clearly, cu z / is a constant of the motion. Equation (8.58) and (8.59) can be written 

^l + Quy = 0, (8.61) 
dt' 

dcu,,' 

— f-Qa) x( = 0, (8.62) 
at' 

where O = (I||/I± — 1 ) w z >. As is easily demonstrated, the solution to the above equations 
is 

av = (v± cos(Ot'), (8.63) 
ay = w± sin(at'), (8.64) 

where uj_l is a constant. Thus, the projection of the angular velocity vector onto the x'-y' 
plane has the fixed length <jo±, and rotates steadily about the z'-axis with angular velocity 
CI. It follows that the length of the angular velocity vector, w = (u>£/ + iVy, + uol,y /2 , is 
a constant of the motion. Clearly, the angular velocity vector makes some constant angle, 
a, with the z'-axis, which implies that ov = a) cos a and uj_l = a) sin a. Hence, the 
components of the angular velocity vector are 

av = a) sina cos(Ot'), (8.65) 

ay = a) sina sin(Ot'), (8.66) 

w z > = uo cos a, (8.67) 

where 

a = a> cosa^--1^ . (8.68) 

We conclude that, in the body frame, the angular velocity vector precesses about the sym- 
metry axis (i.e., the z'-axis) with the angular frequency CI. Now, the components of the 
angular momentum vector are 

L x / = I_l a) sina cos(Ot'), (8.69) 
Ly> = l± a) sina sin(at'), (8.70) 
L z / = Li uo cos a. (8.71) 
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Thus, in the body frame, the angular momentum vector is also of constant length, and pre- 
cesses about the symmetry axis with the angular frequency CI. Furthermore, the angular 
momentum vector makes a constant angle 9 with the symmetry axis, where 

tan9 = ^tanct. (8.72) 

Note that the angular momentum vector, the angular velocity vector, and the symmetry 
axis all lie in the same plane: i.e., e z , ■ L x cu = 0, as can easily be verified. Moreover, the 
angular momentum vector lies between the angular velocity vector and the symmetry axis 
(i.e., 9 < a) for a flattened (or oblate) body (i.e., l± < Ij ), whereas the angular velocity 
vector lies between the angular momentum vector and the symmetry axis (i.e., 9 > a) for 
an elongated (or prolate) body (i.e., l ± > I ). 

Let us now consider the most general motion of a freely rotating asymmetric rigid body, 
as seen in the body frame. Since a freely rotating body experiences no external torques, 
its angular momentum vector L is a constant of the motion in the inertial fixed frame. In 
general, the direction of this vector varies with time in the non-inertial body frame, but its 
length remains fixed. This can be seen from Equation (8.49): if dL/dt = then the scalar 
product of this equation with L implies that dL 2 /dt' = 0. It follows from Equation (8.57) 
that 

L 2 = I 2 , x , a> 2 , + I 2 , y , cu 2 , + I 2 , z , cu 2 , = constant. (8.73) 

The above constraint can also be derived directly from Euler's equations, (8.51)-(8.53), 
by setting T x , = T y , = T z , = 0. A freely rotating body subject to no external torques clearly 
has a constant rotational kinetic energy. Hence, from Equation (8.16), 

cu • L = I x , x , cu 2 , + I y >y> cu 2 , + I z , z , cu 2 , = constant. (8.74) 

This constraint can also be derived directly from Euler's equations. We conclude that, 
in the body frame, the components of cu must simultaneously satisfy the two constraints 
(8.73) and (8.74). These constraints are the equations of two ellipsoids whose principal 
axes coincide with the principal axes of the body, and whose principal radii are in the ratio 
I x , x , : I~} yl : l~) z , and I", 1 /, 2 : : IJ/ 1 ^ 2 , respectively. In general, the intersection of these 
two ellipsoids is a closed curve. Hence, we conclude that the most general motion of a freely 
rotating asymmetric body, as seen in the body frame, is a form of irregular precession in 
which the tip of the angular velocity vector cu periodically traces out the aforementioned 
closed curve. It is easily demonstrated that the tip of the angular momentum vector L 
periodically traces out a different closed curve. 

8.8 Eulerian Angles 

We have seen how we can solve Euler's equations to determine the properties of a rotating 
body in the co-rotating body frame. Let us now investigate how we can determine the same 
properties in the inertial fixed frame. 
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The fixed frame and the body frame share the same origin. Hence, we can transform 
from one to the other by means of an appropriate rotation of our coordinate axes. In gen- 
eral, if we restrict ourselves to rotations about one of the Cartesian axes, three successive 
rotations are required to transform the fixed frame into the body frame. There are, in fact, 
many different ways to combined three successive rotations in order to achieve this. In the 
following, we shall describe the most widely used method, which is due to Euler. 

We start in the fixed frame, which has coordinates x, y, z, and unit vectors e x , e y , e z . 
Our first rotation is counterclockwise (looking down the axis) through an angle 4> about 
the z-axis. The new frame has coordinates x", y", z", and unit vectors e x », e y », e 2 ». Ac- 
cording to Equations (A.17)-(A.19), the transformation of coordinates can be represented 
as follows: 
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(8.75) 



The angular velocity vector associated with 4> has the magnitude cj), and is directed along 
e z (i.e., along the axis of rotation). Hence, we can write 



cu^ = 4>e z . 



(8.76) 



Clearly, 4> is the precession rate about the e z axis, as seen in the fixed frame. 

The second rotation is counterclockwise (looking down the axis) through an angle 
about the x"-axis. The new frame has coordinates x'", y'", z'" , and unit vectors e x /», 
e y ///, e 2 /». By analogy with Equation (8.75), the transformation of coordinates can be 
represented as follows: 
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The angular velocity vector associated with has the magnitude 0, and is directed along 
e x » (i.e., along the axis of rotation). Hence, we can write 



cu = 0e x ». 



(8.78) 



The third rotation is counterclockwise (looking down the axis) through an angle tJj 
about the z //; -axis. The new frame is the body frame, which has coordinates x', y', z', and 
unit vectors e x /, e y /, e z /. The transformation of coordinates can be represented as follows: 




/ costJ; sinij) 
— sinij) costJ; 
V 1 



\ 
/ 



(8.79) 



The angular velocity vector associated with tJj has the magnitude and is directed along 
e 2 » (i.e., along the axis of rotation). Note that e 2 /» = e z /, since the third rotation is about 
e 2 /». Hence, we can write 

o^=ijje z ,. (8.80) 



Rigid Body Rotation 



117 



Clearly, i[> is minus the precession rate about the e z < axis, as seen in the body frame. 

The full transformation between the fixed frame and the body frame is rather compli- 
cated. However, the following results can easily be verified: 

e z = sinijj sin0 e x / + cosiJj sin0 e y > + cos e 2 /, (8.81) 
e x „ = cosi[>e x / — sini]je y /. (8.82) 

It follows from Equation (8.81) that e z e z / = cos0. In other words, is the angle of 
inclination between the z- and z'-axes. Finally, since the total angular velocity can be 
written 

cu = cu^ + ojo + cu^,, (8.83) 
Equations (8.76), (8.78), and (8.80)-(8.82) yield 

uv = simj; sin0 cf) + cosij) 0, (8.84) 
w y > = cosij) sin0 cj) — sinij) 0, (8.85) 
av = cos0cj) + ii). (8.86) 



The angles 4>, 0, and i|> are termed Eulerian angles. Each has a clear physical interpre- 
tation: 4> is the angle of precession about the e z axis in the fixed frame, is minus the 
angle of precession about the e z > axis in the body frame, and is the angle of inclination 
between the e z and e z > axes. Moreover, we can express the components of the angular 
velocity vector cu in the body frame entirely in terms of the Eulerian angles, and their time 
derivatives [see Equations (8.84)-(8.86)]. 

Consider a rigid body which is constrained to rotate about a fixed axis with the constant 
angular velocity tu. Let the fixed angular velocity vector point along the z-axis. In the 
previous section, we saw that the angular momentum and the torque were both steady in 
the body frame. Since there is no precession of quantities in the body frame, it follows that 
the Eulerian angle \Jj is constant. Furthermore, since the angular velocity vector is fixed in 
the body frame, as well as the fixed frame [as can be seen by applying Equation (8.49) to 
cu instead of L], it must subtend a constant angle with the e z / axis. Hence, the Eulerian 
angle is also constant. It follows from Equations (8.84)-(8.86) that 

av = sinij) sin0 4>, (8.87) 
ay = cosij) sin0 (j), (8.88) 
av = cos 4>, (8.89) 

which implies that cu = (o) x , H-cu 2 , + o) z ,) 1/2 = 4>. In other words, the precession rate, 4>, in 
the fixed frame is equal to cu. Hence, in the fixed frame, the constant torque and angular 
momentum vectors found in the body frame precess about the angular velocity vector (i.e., 
about the z-axis) at the rate cu. As discussed in the previous section, for the special case 
where the angular velocity vector is parallel to one of the principal axes of the body, the 
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angular momentum vector is parallel to the angular velocity vector, and the torque is zero. 
Thus, in this case, there is no precession in the fixed frame. 

Consider a rotating device such as a flywheel or a propeller. If the device is statically 
balanced then its center of mass lies on the axis of rotation. This is desirable since, oth- 
erwise, gravity, which effectively acts at the center of mass, exerts a varying torque about 
the axis of rotation as the device rotates, giving rise to unsteady rotation. If the device is 
dynamically balanced then the axis of rotation is also a principal axis, so that, as the device 
rotates its angular momentum vector, L, remains parallel to the axis of rotation. This is 
desirable since, otherwise, the angular momentum vector is not parallel to the axis of rota- 
tion, and, therefore, precesses around it. Since dL/ dt is equal to the torque, a precessing 
torque must also be applied to the device (at right-angles to both the axis and L). The 
result is a reaction on the bearings which can give rise to violent vibration and wobbling, 
even when the device is statically balanced. 

Consider a freely rotating body which is rotationally symmetric about one axis (the z'- 
axis) . In the absence of an external torque, the angular momentum vector L is a constant 
of the motion [see Equation (8.3)]. Let L point along the z-axis. In the previous section, 
we saw that the angular momentum vector subtends a constant angle with the axis of 
symmetry: i.e., with the z'-axis. Hence, the time derivative of the Eulerian angle 9 is zero. 
We also saw that the angular momentum vector, the axis of symmetry, and the angular 
velocity vector are coplanar. Consider an instant in time at which all of these vectors 
lie in the y'-z' plane. This implies that uv = 0. According to the previous section, the 
angular velocity vector subtends a constant angle cc with the symmetry axis. It follows that 
ay = cu sin a and av = cu cos a. Equation (8.84) yields \p = 0. Hence, Equation (8.85) 
yields 

cu sin a = sin 9 (}). (8.90) 
This can be combined with Equation (8.72) to give 

-,1/2 

4> = cu 1 + ( 7T - 1 ) cos z a . (8.91) 



Finally, Equations (8.86), together with (8.72) and (8.90), yields 

\i> = cu cos a — cos 9 cb = cu cos a ( 1 — = w cos a f 1 — ^- ) . (8.92) 

V tan 9/ \ l±J 

A comparison of the above equation with Equation (8.68) gives 

ip = -CI. (8.93) 

Thus, as expected, ip is minus the precession rate (of the angular momentum and angular 
velocity vectors) in the body frame. On the other hand, 4> is the precession rate (of the 
angular velocity vector and the symmetry axis) in the fixed frame. Note that cj) and CI are 
quite dissimilar. For instance, CI is negative for elongated bodies (I < l ± ) whereas cj) is 
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positive definite. It follows that the precession is always in the same sense as L z in the 
fixed frame, whereas the precession in the body frame is in the opposite sense to L z > for 
elongated bodies. We found, in the previous section, that for a flattened body the angular 
momentum vector lies between the angular velocity vector and the symmetry axis. This 
means that, in the fixed frame, the angular velocity vector and the symmetry axis lie on 
opposite sides of the fixed angular momentum vector. On the other hand, for an elongated 
body we found that the angular velocity vector lies between the angular momentum vector 
and the symmetry axis. This means that, in the fixed frame, the angular velocity vector 
and the symmetry axis lie on the same side of the fixed angular momentum vector. (Recall 
that the angular momentum vector, the angular velocity vector, and the symmetry axis, are 
coplanar.) 

As an example, consider the free rotation of a thin disk. It is easily demonstrated (from 
the perpendicular axis theorem) that 

I||=2I ± (8.94) 
for such a disk. Hence, from Equation (8.68), the precession rate in the body frame is 

Q = cu cos a. (8.95) 

According to Equation (8.91), the precession rate in the fixed frame is 

r , il/2 

cj> = w 1+3 cos 2 cc . (8.96) 



In the limit in which ex is small (i.e., in which the angular velocity vector is almost parallel 
to the symmetry axis), we obtain 

Q ~ u>, (8.97) 
<j> ~ 2u>. (8.98) 

Thus, the symmetry axis precesses in the fixed frame at approximately twice the angular 
speed of rotation. This precession is manifest as a wobbling motion. 

It is known that the axis of rotation of the Earth is very slightly inclined to its symmetry 
axis (which passes through the two geographic poles). The angle ex is approximately 0.2 
seconds of an arc (which corresponds to a distance of about 6 m on the Earth's surface) . It 
is also known that the ratio of the terrestrial moments of inertia is about I||/I_i_ = 1 .00327, 
as determined from the Earth's oblateness — see Section 12.7. Hence, from (8.68), the 
precession rate of the angular velocity vector about the symmetry axis, as viewed in a 
geostationary reference frame, is 



giving a precession period of 



O = 0.00327 cu, (8.99) 



27T 

T' = — = 305 days. (8.100) 
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(Of course, 2n/uo = 1 day.) The observed period of precession is about 440 days. The 
disagreement between theory and observation is attributed to the fact that the Earth is 
not perfectly rigid. The Earth's symmetry axis subtends an angle 9 ~ <x = 0.2" [see 
(8.72)] with its angular momentum vector, but lies on the opposite side of this vector to 
the angular velocity vector. This implies that, as viewed from space, the Earth's angular 
velocity vector is almost parallel to its fixed angular momentum vector, whereas its sym- 
metry axis subtends an angle of 0.2" with both vectors, and precesses about them. The 
(theoretical) precession rate of the Earth's symmetry axis, as seen from space, is given by 
Equation (8.91): 

(j> = 1.00327 o>. (8.101) 

The associated precession period is 

T = ^ = 0.997 days. (8.102) 

The free precession of the Earth's symmetry axis in space, which is known as the Chandler 
wobble, since it was discovered by the American astronomer Seth Chandler in 1891, is 
superimposed on a much slower forced precession, with a period of about 26,000 years, 
caused by the small gravitational torque exerted on the Earth by the Sun and the Moon, as 
a consequence of the Earth's slight oblateness — see Section 12.10. 



8.9 Gyroscopic Precession 

Let us now study the motion of a rigid rotationally symmetric top which is free to turn 
about a fixed point (without friction), but which is subject to a gravitational torque — see 
Figure 8.3. Suppose that the z'-axis coincides with the symmetry axis. Let the principal 
moment of inertia about the symmetry axis be I , and let the other principal moments both 
take the value Suppose that the z-axis points vertically upward, and let the common 
origin, O, of the fixed and body frames coincide with the fixed point about which the top 
turns. Suppose that the center of mass of the top lies a distance I along its symmetry axis 
from point O, and that the mass of the top is m. Let the symmetry axis of the top subtend 
an angle (which is an Eulerian angle) with the upward vertical. 

Consider an instant in time at which the Eulerian angle ip is zero. This implies that 
the x'-axis is horizontal [see Equation (8.81)], as shown in the diagram. The gravitational 
force, which acts at the center of mass, thus exerts a torque m g I sin 9 in the x'-direction. 
Hence, the components of the torque in the body frame are 

l x , = mglsin9, (8.103) 
V = 0, (8.104) 
\> = 0. (8.105) 
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Figure 8.3: A symmetric top. 



The components of the angular velocity vector in the body frame are given by Equa- 
tions (8.84)-(8.86). Thus, Euler's equations (8.51)-(8.53) take the form: 

mglsine = I ± (9 - cos 9 sin 9 4> 2 ) + sin 9 4>, (8.106) 

= I ± (2 cos 9 9 4) + sin 9 <jj>) - L 4 9, (8.107) 

= U, (8.108) 

where 

= I,, (cos9cj)+-ii>) =I||0, (8.109) 

and D. = w z > is the angular velocity of the top. Multiplying Equation (8.107) by sin 9, we 
obtain 

U = 0, (8.110) 

where 

L$ = l± sin 2 9 4> + cos 9. (8.111) 

According to Equations (8.108) and (8.110), the two quantities and are constants 
of the motion. These two quantities are the angular momenta of the system about the z'- 
and z-axis, respectively. (To be more exact, they are the generalized momenta conjugate 
to the coordinates i\) and 4>, respectively — see Section 9.8.) They are conserved because 
the gravitational torque has no component along either the z'- or the z-axis. (Alternatively, 
they are conserved because the Lagrangian of the top does not depend explicitly on the 
coordinates t|> and 9 — see Section 9.8.) 

If there are no frictional forces acting on the top then the total energy, E = K + U, is 
also a constant of the motion. Now, 



E = - (I_l w I, + I_l w + Ij| a> z 2 ,) + m g I cos 9. 



(8.112) 
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When written in terms of the Eulerian angles (with 4> = 0), this becomes 

E = I (l ± 9 2 + I ± sin 2 9cj) 2 + L 2 /I||) +mgl cos0. (8.113) 

Eliminating cj) between Equations (8.111) and (8.113), we obtain the following differential 
equation for 9: 

E = 1 1 ± 9 2 + (L * ~ L ^. C 2 ° S Q 9)2 + 1^ + m g I cos 9. (8.114) 
2 2 1 ± sin 9 2 ly 

Let 

E' = E-1^, (8.115) 

and u = cos 9. It follows that 

it 2 =2(E / -mglu) (1 -u 2 )^ 1 — (L^ — L^u) 2 Ij 2 , (8.116) 

or 

ii 2 = f(u), (8.117) 

where f (u) is a cubic polynomial. In principal, the above equation can be integrated to 
give u (and, hence, 9) as a function of t: 

dU ' (8.118) 



u 



f(u') 



Fortunately, we do not have to perform the above integration (which is very ugly) in 
order to discuss the general properties of the solution to Equation (8.117). It is clear, from 
Equation (8.118), that f(u) needs to be positive in order to obtain a physical solution. 
Hence, the limits of the motion in 9 are determined by the three roots of the equation 
f (u) = 0. Since 9 must lie between and n/2, it follows that u must lie between and 
1. It can easily be demonstrated that f — > ±oo as u — > ±oo. It can also be shown that 
the largest root u 3 lies in the region u 3 > 1 , and the two smaller roots ut and U2 (if they 
exist) lie in the region —1 < u < +1. It follows that, in the region —1 < u < 1, f(u) is 
only positive between ui and u 2 . Figure 8.4 shows a case where ui and u 2 lie in the range 
to 1. The corresponding values of 9 — 9i and 9 2 , say — are then the limits of the vertical 
motion. The axis of the top oscillates backward and forward between these two values of 9 
as the top precesses about the vertical axis. This oscillation is called nutation. Incidentally, 
if ui becomes negative then the nutation will cause the top to strike the ground (assuming 
that it is spinning on a level surface). 

If there is a double root of f (u) = (i.e., if U] = u 2 ) then there is no nutation, and the 
top precesses steadily. However, the criterion for steady precession is most easily obtained 
directly from Equation (8.106). In the absence of nutation, 9 = 9 = 0. Hence, we obtain 

mgl = -I ± cose^ + L^cj), (8.119) 
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Figure 8.4: The function f(u). 



or 

Q = ^- + — cos9(j). (8.120) 

!||4> !|| 

The above equation is the criterion for steady precession. Since the right-hand side of 
Equation (8.120) possesses a minimum value, which is given by 2 v / mgTJi _ cos~9/I| , it 
follows that 

Q > Q clfl = ?V*9U lC o St ) (8.121) 

is a necessary condition for obtaining steady precession at the inclination angle 6. For 
Q. > O crit , there are two roots to Equation (8.120), corresponding to a slow and a fast 
steady precession rate for a given inclination angle 0. If 12 > n crit then these two roots 
are approximately given by 

(4>)siow ^ (8.122) 
I O 

The slower of these two precession rates is the one which is generally observed. 



8.10 Rotational Stability 

Consider a rigid body for which all of the principal moments of inertia are distinct. Let 
I Z 'z' > Vy' > Ix'x'- Suppose that the body is rotating freely about one of its principal axes. 
What happens when the body is slightly disturbed? 
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Let the body be initially rotating about the x'-axis, so that 

cu = uv Gx'- 

If we apply a slight perturbation then the angular velocity becomes 

cu = ov e x / + Ae y / + (xe 2 /, 



(8.124) 



(8.125) 



where A and \x are both assumed to be small. Euler's equations (8.51)-(8.53) take the 
form 



\ x , x , (V x l — (ly'y' — I Z 'z') A \i = 0, 
ly'y' A — (I z ' z ' — I x 'x') CU X / [J. = 0, 

I z / Z / (X— (I x / X / — ly'y') CU X / A = 0. 



(8.126) 
(8.127) 
(8.128) 



Since A \x is second-order in small quantities — and, therefore, negligible — the first of the 
above equations tells us that ov is an approximate constant of the motion. The other two 
equations can be written 



A = 



M- = 



(I z / Z / — Ix'x') UV 
W 



(Iy_V ~ Ix'x') W x i 
Ix'x' 



(8.129) 
(8.130) 



Differentiating the first equation with respect to time, and then eliminating \i, we obtain 



A + 



JyV — Ix'x') UzV ~ Ix'x') 
ly'y' Iz'z' 



CU V 2 , A = 0. 



(8.131) 



It is easily demonstrated that \x satisfies the same differential equation. Since the term in 
square brackets in the above equation is positive, the equation takes the form of a simple 
harmonic equation, and, thus, has the bounded solution: 



A = A cos(O x / 1' — a). 
Here, A and a are constants of integration, and 



(8.132) 



a x ' 



Jy'y' ~ Ix'x') (Iz'z' — Ix'x' 
ly'y' Iz'z' 



1/2 



OV. 



(8.133) 



Thus, the body oscillates sinusoidally about its initial state with the angular frequency Q X '- 
It follows that the body is stable to small perturbations when rotating about the x'-axis, in 
the sense that the amplitude of such perturbations does not grow in time. 
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Suppose that the body is initially rotating about the z'-axis, and is subject to a small per- 
turbation. A similar argument to the above allows us to conclude that the body oscillates 
sinusoidally about its initial state with angular frequency 



(Iz 



Ix'x') Uz'z' — ly'y'. 



1 V2 



Ix'x' ly'y' 



(8.134) 



Hence, the body is also stable to small perturbations when rotating about the z'-axis. 

Suppose, finally, that the body is initially rotating about the y'-axis, and is subject to a 
small perturbation, such that 



cu = Ae x / + ay e y > + ue z /. 
It is easily demonstrated that A satisfies the following differential equation: 



(8.135) 



(ItjV ~ Ix'x') Uz'z' — Iy_V 
Ix'x' Iz'z' 



ay A = 0. 



(8.136) 



Note that the term in square brackets is positive. Hence, the above equation is not the 
simple harmonic equation. Indeed its solution takes the form 



A = Ae kt ' + Be 



,-lct' 



(8.137) 



Here, A and B are constants of integration, and 



k = 



(Vy' — Ix'x') (Iz'z' — ly'y') 



Ix'x' Iz'z' 



1/2 



ay. 



(8.138) 



In this case, the amplitude of the perturbation grows exponentially in time. Hence, the 
body is unstable to small perturbations when rotating about the u'-axis. 

In conclusion, a rigid body with three distinct principal moments of inertia is stable to 
small perturbations when rotating about the principal axes with the largest and smallest 
moments, but is unstable when rotating about the axis with the intermediate moment. 

Finally, if two of the principal moments are the same then it can be shown that the body 
is only stable to small perturbations when rotating about the principal axis whose moment 
is distinct from the other two. 



8.11 Exercises 

8.1. Find the principal axes of rotation and the principal moments of inertia for a thin uniform 
rectangular plate of mass m and dimensions 2 a by a for rotation about axes passing through 
(a) the center of mass, and (b) a corner. 
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8.2. A rigid body having an axis of symmetry rotates freely about a fixed point under no torques. 
If ex. is the angle between the symmetry axis and the instantaneous axis of rotation, show that 
the angle between the axis of rotation and the invariable line (the L vector) is 



tan 1 



(I|l — IjJ tana 
In + Ij_ tan 2 a 



where Iy (the moment of inertia about the symmetry axis) is greater than Ij_ (the moment of 
inertia about an axis normal to the symmetry axis). 

8.3. Since the greatest value of I||/I± is 2 (symmetrical lamina) show from the previous result 
that the angle between the angular velocity and angular momentum vectors cannot exceed 
tan -1 (1/\/8) ~ 19.5°. Find the corresponding value of a. 

8.4. A thin uniform rod of length I and mass m is constrained to rotate with constant angular 
velocity w about an axis passing through the center of the rod, and making an angle a with 
the rod. Show that the angular momentum about the center of the rod is perpendicular to 
the rod, and is of magnitude (ml 2 tu/12) sin a. Show that the torque is perpendicular to both 
the rod and the angular momentum vector, and is of magnitude (ml 2 cu 2 /12) sin <x cos a. 

8.5. A thin uniform disk of radius a and mass m is constrained to rotate with constant angular 
velocity w about an axis passing through its center, and making an angle a with the normal 
to the disk. Find the angular momentum about the center of the disk, as well as the torque 
acting on the disk. 

8.6. Demonstrate that for an isolated rigid body which possesses an axis of symmetry, and rotates 
about one of its principal axes, the motion is only stable to small perturbations if the principal 
axis is that which corresponds to the symmetry axis. 
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9 Lagrangian Dynamics 



9. 1 Introduction 

This chapter describes an elegant reformulation of the laws of Newtonian dynamics which 
is due to the French/Italian mathematician Joseph Louis Lagrange. This reformulation is 
particularly useful for finding the equations of motion of complicated dynamical systems. 

9.2 Generalized Coordinates 

Let the q i} for i = 1 , T , be a set of coordinates which uniquely specifies the instantaneous 
configuration of some dynamical system. Here, it is assumed that each of the q^ can vary 
independently. The q^ might be Cartesian coordinates, or polar coordinates, or angles, 
or some mixture of all three types of coordinate, and are, therefore, termed generalized 
coordinates. A dynamical system whose instantaneous configuration is fully specified by 
T independent generalized coordinates is said to have T degrees of freedom. For instance, 
the instantaneous position of a particle moving freely in three dimensions is completely 
specified by its three Cartesian coordinates, x, y, and z. Moreover, these coordinates are 
clearly independent of one another. Hence, a dynamical system consisting of a single 
particle moving freely in three dimensions has three degrees of freedom. If there are two 
freely moving particles then the system has six degrees of freedom, and so on. 

Suppose that we have a dynamical system consisting of N particles moving freely in 
three dimensions. This is an T = 3 N degree of freedom system whose instantaneous con- 
figuration can be specified by J 7 Cartesian coordinates. Let us denote these coordinates the 
Xj, for j = 1 , J 7 . Thus, Xi,X2,X3 are the Cartesian coordinates of the first particle, X4,X5,xg 
the Cartesian coordinates of the second particle, etc. Suppose that the instantaneous con- 
figuration of the system can also be specified by T generalized coordinates, which we shall 
denote the q^, for i = 1 , T . Thus, the q { might be the spherical coordinates of the particles. 
In general, we expect the Xj to be functions of the q t . In other words, 



for j = 1 , T. Here, for the sake of generality, we have included the possibility that the 
functional relationship between the Xj and the q { might depend on the time, t, explicitly. 
This would be the case if the dynamical system were subject to time varying constraints. 
For instance, a system consisting of a particle constrained to move on a surface which is 
itself moving. Finally, by the chain rule, the variation of the Xj due to a variation of the q^ 
(at constant t) is given by 



xj = xj(qi,q 2 ,--- 



(9.1) 




(9.2) 



for j = 1,7". 
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9.3 Generalized Forces 

The work done on the dynamical system when its Cartesian coordinates change by 5xj is 
simply 

5W = Y_ f j 5 *j (9-3) 

Here, the fj are the Cartesian components of the forces acting on the various particles 
making up the system. Thus, f h f 2 , f 3 are the components of the force acting on the first 
particle, f 4 , f 5 , f 6 the components of the force acting on the second particle, etc. Using 
Equation (9.2), we can also write 

3x- 

j=1,jr i=l,jr Hi 

The above expression can be rearranged to give 

SW=^Q i 5q i , (9.5) 

i=1,.F 

where 

Here, the Q t are termed generalized forces. Note that a generalized force does not neces- 
sarily have the dimensions of force. However, the product Q t q t must have the dimensions 
of work. Thus, if a particular q t is a Cartesian coordinate then the associated Qi is a force. 
Conversely, if a particular qt is an angle then the associated Q { is a torque. 

Suppose that the dynamical system in question is conservative. It follows that 



3U 

3xj 



fj = -— , (9-7) 



for j = 1, J 7 , where U(xi,x 2 , • • • ,xjr, t) is the system's potential energy. Hence, according 
to Equation (9.6), 

L aua^ = _au (98) 

.f^ 3xj 3q t 3qt 

for i = 1 , T . 

9.4 Lagrange's Equation 

The Cartesian equations of motion of our system take the form 

mjXj=fj, (9.9) 



Lagrangian Dynamics 



129 



for j = 1 , T , where mi , m.2, m.3 are each equal to the mass of the first particle, ttu, m.5, m.6 
are each equal to the mass of the second particle, etc. Furthermore, the kinetic energy of 
the system can be written 

K = l^m j x j 2 . (9.10) 



2 

Now, since Xj = Xj(qi, q 2 , • • • , qjr, t), we can write 

V— 3Xj 3Xj rr, 

for j = 1, T . Hence, it follows that Xj = Xj(qi, q 2 , • • • , q^, qi, q 2 , • • •, q^,t). According to 
the above equation, 



3xj 3xj 



(9.12) 



3qi 3qt' 

where we are treating the c|i and the q t as independent variables. 

Multiplying Equation (9.12) by Xj, and then differentiating with respect to time, we 



obtain 



Now, 



Furthermore, 



and 



d / . 3xj \ d / . 3xi \ .. 3x; . d / Sx, \ , „^ 

dlb^j = dl^aij =x ^ + ^dl(aiJ- C9J3) 

d /3Xj\ _ y 3 2 Xj | 3 2 Xj 

dtV3qi/ k f^ r aq i 3qic 3qi3t 



1 3x; 2 . 3xj 

2 3qi 3qi 



1 9 *j _ i 9 *j _ * 9 ( sr dx ) h , 9x i 
x i an, " Xj an.. 2_ an,. qk + at 



2 3qt 3qt 3qi \ k tr> 9 qk 



Xj 



3 2 Xj . 3 2 xj 



I- a^7 qk + 



vk~T> 9c|i9c|k 3qi3t 



where use has been made of Equation (9.14). Thus, it follows from Equations (9.13), 
(9.15), and (9.16) that 

d (1 3xA _ .. 3xj 1 3x 2 

dtl,2 3qJ _Xj 3 qi + 2 a qi ' (9 ?) 
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Let us take the above equation, multiply by rrij, and then sum over all j. We obtain 

d/3K\ r 3i 3K 

where use has been made of Equations (9.9) and (9.10). Thus, it follows from Equa- 
tion (9.6) that 



dt 

Finally, making use of Equation (9.8), we get 



d/3K\ 3U 3K 
dt\9qj dq { dq { ' 



(9.20) 



It is helpful to introduce a function L, called the Lagrangian, which is defined as the 
difference between the kinetic and potential energies of the dynamical system under in- 
vestigation: 

L = K-U. (9.21) 

Since the potential energy U is clearly independent of the qi, it follows from Equation (9.20) 
that 

dtydqij dq { 

for i = 1 , J 7 . This equation is known as Lagrange's equation. 

According to the above analysis, if we can express the kinetic and potential energies 
of our dynamical system solely in terms of our generalized coordinates and their time 
derivatives then we can immediately write down the equations of motion of the system, 
expressed in terms of the generalized coordinates, using Lagrange's equation, (9.22). Un- 
fortunately, this scheme only works for conservative systems. Let us now consider some 
examples. 



9.5 Motion in a Central Potential 

Consider a particle of mass m moving in two dimensions in the central potential U(r). 
This is clearly a two degree of freedom dynamical system. As described in Section 5.5, 
the particle's instantaneous position is most conveniently specified in terms of the plane 
polar coordinates r and 9. These are our two generalized coordinates. According to Equa- 
tion (5.14), the square of the particle's velocity can be written 

v 2 =f 2 +(r9) 2 . (9.23) 

Hence, the Lagrangian of the system takes the form 



L = -m(f 2 + r 2 9 2 ) -U(r). 



(9.24) 
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Note that 



= mr, = mr 9 2 - — , (9.25) 

3r or dr 

^: = mr 2 9, |t =0 . (9.26) 

30 30 



Now, Lagrange's equation (9.22) yields the equations of motion, 



-[ 


'9L\ 


3L 


dt \ 


v 3fJ 


~ 9r~ 


-( 


'9L\ 


3L 




v39/ 


"39 



Hence, we obtain 



= 0, (9.27) 
= 0. (9.28) 



imf)-mre 2 + ^ = 0, (9.29) 
dt dr 

^(mr 2 9)= 0, (9.30) 



or 



dV 

r-rd 2 = -— , (9.31) 
dr 

r 2 9 = H, (9.32) 

where V = U/m, and h is a constant. We recognize Equations (9.31) and (9.32) as the 
equations we derived in Chapter 5 for motion in a central potential. The advantage of 
the Lagrangian method of deriving these equations is that we avoid having to express the 
acceleration in terms of the generalized coordinates r and 9. 



9.6 Atwood Machines 



An Atwood machine consists of two weights, of mass mi and m 2 , connected by a light 
inextensible cord of length I, which passes over a pulley of radius a < I, and moment of 
inertia I. See Figure 9.1. 

Referring to the diagram, we can see that this is a one degree of freedom system whose 
instantaneous configuration is specified by the coordinate x. Assuming that the cord does 
not slip with respect to the pulley, the angular velocity of pulley is x/a. Hence, the kinetic 
energy of the system is given by 

1 , 1 , 1 x 2 

K = -m 1 x 2 + -m 2 x 2 + -I^. (9.33) 

2 2 2 a z 
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m 



X 



I — X 



ra 2 



mi 



Figure 9.1: AnAtwood machine. 



The potential energy of the system takes the form 

U = -m.! gx-m 2 g (l-x). (9.34) 
It follows that the Lagrangian is written 

L = j ^mi + m 2 + x 2 + g (mi — m 2 ) x + const. (9.35) 

The equation of motion, 

d /an sl r „^ r ^ 



thus yields 



mi+m 2 + ^]x— g (mi — m 2 ) = 0, (9.37) 



or 



s = 9(m.-^) 
mi + m 2 + I/a z 

which is the correct answer. 

Consider the dynamical system drawn in Figure 9.2. This is an Atwood machine in 
which one of the weights has been replaced by a second Atwood machine with a cord of 
length I'. The system now has two degrees of freedom, and its instantaneous position is 
specified by the two coordinates x and x', as shown. 
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m 



l — X 



mi 




m 2 \ 



l'-x' 



m 3 



Figure 9.2: A double Atwood machine. 

For the sake of simplicity, let us neglect the masses of the two pulleys. Thus, the kinetic 
energy of the system is written 



K = 1 mi x 2 + i m 2 (— x + x') 2 + i m 3 (— x - x') 2 , 
whereas the potential energy takes the form 

U = —mi g x — m 2 g (I — x + x') — m 3 g (I — x + I' — x'). 
It follows that the Lagrangian of the system is 

L = ^m 1 x 2 + ^m 2 (-x + x / ) 2 + ^m 3 (-x-x / ) 2 

+g (mi — m 2 — m 3 ) x + g (m 2 — m 3 ) x' + const. 
Hence, the equations of motion, 

_d (dV\ _ 9L 



(9.39) 



(9.40) 



d /3L 
dt V9X 7 



3x 

3L 

Sx 7 



= 0, 
= 0, 



yield 



mi x + m 2 (x — x.') + m 3 (x. + x') — g (mi — m 2 — m 3 ) = 0, 
m 2 (— x + 5c') + m 3 (x + x') — g (m 2 — m 3 ) = 0. 



(9.41) 

(9.42) 
(9.43) 

(9.44) 
(9.45) 
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M 



x 



Figure 9.3: A sliding plane. 

The accelerations x and x' can be obtained from the above two equations via simple alge- 
bra. 

9.7 Sliding down a Sliding Plane 

Consider the case of a particle of mass m sliding down a smooth inclined plane of mass M 
which is, itself, free to slide on a smooth horizontal surface, as shown in Figure 9.3. This is 
a two degree of freedom system, so we need two coordinates to specify the configuration. 
Let us choose x, the horizontal distance of the plane from some reference point, and x', 
the parallel displacement of the particle from some reference point on the plane. 

Defining x- and y-axes, as shown in the diagram, the x- and y -components of the 
particle's velocity are clearly given by 

v x = x + x' cos 9, (9.46) 
v y = -x' sin 9, (9.47) 

respectively, where 9 is the angle of inclination of the plane with respect to the horizontal. 
Thus, 

v 2 = v 2 + v 2 = x 2 +2xx' cos9 + x /2 . (9.48) 
Hence, the kinetic energy of the system takes the form 

K = l/vlx 2 + ^m(x 2 + 2xx' cos9 + x /2 ), (9.49) 

whereas the potential energy is given by 

U = -m g x' sin 9 + const. (9.50) 
It follows that the Lagrangian is written 

L = \ Mi 2 + ^m(x 2 + 2xx' cos 9 + x /2 ) +mgx' sin 9 + const. (9.51) 
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The equations of motion, 



thus yield 



Mx + m(x + x' cos 9) = 0, (9.54) 
m (x' + x cos 9) — mg sin9 = 0. (9.55) 



Finally, solving for x and x', we obtain 

.. g sin 9 cos 9 

X = ~(m+M)/m-cos 2 9' (9 J 

x' = (9 57) 

1 -mcos 2 9/(m+M)' 

9.8 Generalized Momenta 

Consider the motion of a single particle moving in one dimension. The kinetic energy is 

1 7 

K = -mx 2 , (9.58) 

where m is the mass of the particle, and x its displacement. Now, the particle's linear 
momentum is p = rax. However, this can also be written 

since L = K — U, and the potential energy U is independent of x. 

Consider a dynamical system described by T generalized coordinates q i5 for i = 1 , T. 
By analogy with the above expression, we can define generalized momenta of the form 

Pi = |K (9.60) 

for i = 1 , T . Here, p t is sometimes called the momentum conjugate to the coordinate q t . 
Hence, Lagrange's equation (9.22) can be written 
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for i = 1 , J 7 . Note that a generalized momentum does not necessarily have the dimensions 
of linear momentum. 

Suppose that the Lagrangian L does not depend explicitly on some coordinate q k . It 
follows from Equation (9.61) that 

= iL = 0. (9.62) 
dt 3q k 

Hence, 

p k = const. (9.63) 

The coordinate q k is said to be ignorable in this case. Thus, we conclude that the general- 
ized momentum associated with an ignorable coordinate is a constant of the motion. 

For example, in Section 9.5, the Lagrangian (9.24) for a particle moving in a central 
potential is independent of the angular coordinate 0. Thus, is an ignorable coordinate, 
and 

Pe = ^ = mr 2 (9.64) 

00 

is a constant of the motion. Of course, p e is the angular momentum about the origin. This 
is conserved because a central force exerts no torque about the origin. 

Again, in Section 9.7, the Lagrangian (9.51) for a mass sliding down a sliding slope 
is independent of the Cartesian coordinate x. It follows that x is an ignorable coordinate, 
and 

p x = — = Mx + m(x + x' cos0) (9.65) 
3x 

is a constant of the motion. Of course, p x is the total linear momentum in the x-direction. 
This is conserved because there is no external force acting on the system in the x-direction. 



9.9 Spherical Pendulum 

Consider a pendulum consisting of a compact mass m on the end of light inextensible string 
of length I. Suppose that the mass is free to move in any direction (as long as the string 
remains taut) . Let the fixed end of the string be located at the origin of our coordinate 
system. We can define Cartesian coordinates, (x, u, z), such that the z-axis points vertically 
upward. We can also define spherical coordinates, (r, 0, (£>), whose axis points along 
the —z-axis. The latter coordinates are the most convenient, since r is constrained to 
always take the value I. However, the two angular coordinates, and fy, are free to vary 
independently. Hence, this is a two degree of freedom system. 

The Cartesian coordinates can be written in terms of the angular coordinates and (J). 
In fact, 

x = I sin cos c(), (9.66) 
y = I sin 9 sin 4). (9.67) 
z = -lcos0. (9.68) 
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Hence, the potential energy of the system is 

U = mgz = — mgl cos 9. (9.69) 

Also, 

v 2 =x 2 +i) 2 + z 2 = l 2 (9 2 + sin 2 9cj) 2 ). (9.70) 
Thus, the Lagrangian of the system is written 

L = 1 ml 2 (9 2 + sin 2 9 cj) 2 ) + m g I cos 9. (9.71) 
Note that the Lagrangian is independent of the angular coordinate 4). It follows that 

p^ = ^ = ml 2 sin 2 9 cj> (9.72) 
3<J) 

is a constant of the motion. Of course, p^ is the angular momentum of the system about 
the z-axis. This is conserved because neither the tension in the string nor the force of 
gravity exert a torque about the z-axis. Conservation of angular momentum about the 
z-axis implies that 

sin 2 9 (j> = h, (9.73) 

where h is a constant. 

The equation of motion of the system, 



yields 



9 + | sin 9- sin 9 cos 9 cj) 2 = 0, (9.75) 
or 

9+9 sin0 _ h 2^. =O) {9J6) 
I sin 9 

where use has been made of Equation (9.73). 

Suppose that cp = cp = const. It follows that cj) = h = 0. Hence, Equation (9.76) yields 

9 + | sin 9=0. (9.77) 

This, of course, is the equation of a simple pendulum whose motion is restricted to the 
vertical plane cj) = 4) — see Section 3.10. 

Suppose that 9 = 9 = const. It follows from Equation (9.73) that 4> = 4> = const.: i.e., 
the pendulum bob rotates uniformly in a horizontal plane. According to Equations (9.73) 
and (9.76), 

4>o = (9.78) 
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where d = I cos O is the vertical distance of the plane of rotation below the pivot point. 
This type of pendulum is usually called a conical pendulum, since the string attached to the 
pendulum bob sweeps out a cone as the bob rotates. 

Suppose, finally, that the motion is almost conical: i.e., the value of 9 remains close to 
the value O . Let 

9 = 9 + 69. (9.79) 

Taylor expanding Equation (9.76) to first order in 59, the zeroth order terms cancel out, 
and we are left with 

69 + 4) 2 (1 +3 cos 2 9 ) 69 ~ 0. (9.80) 
Hence, solving the above equation, we obtain 

9 ~ 9 + S9 cos(Ot), (9.81) 

where 

O = <j> y/V+3 cos 2 9 . (9.82) 

Thus, the angle 9 executes simple harmonic motion about its mean value 9o at the angular 
frequency O. 

Now the azimuthal angle, cp, increases by 

Act * 4)o £ = 2fl (9.83) 

li v 1 + 3 cos 2 9 

as the angle of inclination to the vertical, 9, goes between successive maxima and minima. 
Suppose that 9 is small. In this case, Afy is slightly greater than n/2. Now if Acf) were 
exactly n/2 then the pendulum bob would trace out the outline of a slightly warped circle: 
i.e., something like the outline of a potato chip or a saddle. The fact that Acf) is slightly 
greater than n/2 means that this shape precesses about the z-axis in the same direction as 
the direction rotation of the bob. The precession rate increases as the angle of inclination 
9 increases. Suppose, now, that 9 is slightly less than n/2. (Of course, 9 can never 
exceed n/2). In this case, Acf) is slightly less than n. Now if Acf) were exactly n then 
the pendulum bob would trace out the outline of a slightly tilted circle. The fact that 
Acf) is slightly less than n means that this shape precesses about the z-axis in the opposite 
direction to the direction of rotation of the bob. The precession rate increases as the angle 
of inclination 9 decreases below n/2. 



9.10 Exercises 

9.1. A horizontal rod AB rotates with constant angular velocity tu about its midpoint O. A particle 
P is attached to it by equal strings AP, BP. If 9 is the inclination of the plane APB to the 
vertical, prove that 

d 2 e 7 q 

— T — cu sin 6 cos = — - sin 9, 

dt 2 I 

where I = OP. Deduce the condition that the vertical position of OP should be stable. 
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9.2. A double pendulum consists of two simple pendula, with one pendulum suspended from the 
bob of the other. If the two pendula have equal lengths, I, and have bobs of equal mass, 
m, and if both pendula are confined to move in the same vertical plane, find Lagrange's 
equations of motion for the system. Use 9 and <$> — the angles the upper and lower pendulums 
make with the downward vertical (respectively) — as the generalized coordinates. Do not 
assume small angles. 

9.3. The surface of the Diskworld is a disk of radius R which rotates uniformly about a per- 
pendicular axis passing through its center with angular velocity O. Diskworld gravitational 
acceleration is of magnitude g, and is everywhere directed normal to the disk. Find the 
Lagrangian of a projectile of mass m using co-rotating cylindrical polar coordinates as the 
generalized coordinates. What are the momenta conjugate to each coordinate? Are any of 
these momenta conserved? Find Lagrange's equations of motion for the projectile. 

9.4. Find Lagrange's equations of motion for an elastic pendulum consisting of a particle of mass 
m attached to an elastic string of stiffness k and unstretched length l . Assume that the 
motion takes place in a vertical plane. 

9.5. A disk of mass M and radius R rolls without slipping down a plane inclined at an angle a 
to the horizontal. The disk has a short weightless axle of negligible radius. From this axle 
is suspended a simple pendulum of length I < R whose bob is of mass m. Assume that the 
motion of the pendulum takes place in the plane of the disk. Find Lagrange's equations of 
motion of the system. 

z 





m 

/ A 

u/\ 




/c J 




/\U t V 


p 





X 



9.6. A vertical circular hoop of radius a is rotated in a vertical plane about a point P on its 
circumference at the constant angular velocity tu. A bead of mass m slides without friction 
on the hoop. Find the kinetic energy, the potential energy, the Lagrangian, and Largrange's 
equation of motion of the bead, respectively, in terms of the angular coordinate shown 
in the above diagram. Here, x is a horizontal Cartesian coordinate, z a vertical Cartesian 
coordinate, and C the center of the hoop. 

9.7. Consider a spherical pendulum of length I. Suppose that the string is initially horizontal, 
and the bob is rotating horizontally with tangental velocity v. Demonstrate that, at its lowest 
subsequent point, the bob will have fallen a vertical height I e~ u , where 



sinh u = 




Show that if v 2 is large compared to 4 g I then this result becomes approximately 2 g l 2 /v 2 . 
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9.8. The kinetic energy of a rotating rigid object with an axis of symmetry can be written 

1 



K 2 



I_l 9 2 + sin 2 9 + In cos 2 9) cj? 2 + 2 1 cos 9 cj>\p + In 4) 2 



where Iy is the moment of inertia about the symmetry axis, Ij_ is the moment of inertia 
about an axis perpendicular to the symmetry axis, and 9, c}), \p are the three Euler angles. 
Suppose that the object is rotating freely. Find the momenta conjugate to the Euler angles. 
Which of these momenta are conserved? Find Lagrange's equations of motion for the system. 
Demonstrate that if the system is precessing steadily (which implies that 9, cj), and 4> are 
constants) then 

ij> = I l± ~ 1,1 J cos94). 



9.9. Consider a nonconservative system in which the dissipative forces take the form fi = — lq x 
where the Xi are Cartesian coordinates, and the ki are all positive. Demonstrate that the dis- 
sipative forces can be incorporated into the Lagrangian formalism provided that Lagrange's 
equations of motion are modified to read 

_d /^L_\_^L_ 9R _ Q 
dt V3qJ 3qi 3q t ~ ' 



where 

l? — 

2 

is termed the Rayleigh Dissipation Function. 
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10 Hamiltonian Dynamics 



10. 1 Introduction 



This chapter investigates the application of variational principles to Newtonian dynamics. 



10.2 Calculus of Variations 



It is a well-known fact, first enunciated by Archimedes, that the shortest distance between 
two points in a plane is a straight-line. However, suppose that we wish to demonstrate this 
result from first principles. Let us consider the length, I, of various curves, y (x), which run 
between two fixed points, A and B, in a plane, as illustrated in Figure 10.1. Now, I takes 
the form 

rB rb 

1= [dx 2 + du 2 ] 1/2 = [1 +u /2 (x)] 1/2 dx, (10.1) 

Ja Ja 

where y' = dy/dx. Note that I is a function of the function y(x). In mathematics, a 
function of a function is termed afunctional. 



y 




a 



x 



Figure 10.1: Different paths between points A and B. 

Now, in order to find the shortest path between points A and B, we need to minimize the 
functional I with respect to small variations in the function y{x), subject to the constraint 
that the end points, A and B, remain fixed. In other words, we need to solve 

61 = 0. (10.2) 

The meaning of the above equation is that if u(x) — > u(x) + 6y(x), where 6u(x) is small, 
then the first-order variation in I, denoted 61, vanishes. In other words, I — > l+C(6u 2 ) . The 
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particular function y(x) for which 61 = obviously yields an extremum of I (i.e., either a 
maximum or a minimum) . Hopefully, in the case under consideration, it yields a minimum 
of I. 

Consider a general functional of the form 



rb 



I = 



F(y,y',x) dx, 



(10.3) 



where the end points of the integration are fixed. Suppose that y (x) — > y(x) + Sy (x). The 
first-order variation in I is written 



51 = 



(10.4) 



where 6y' = d(6y)/dx. Setting 51 to zero, we obtain 

f b /9F c 9F c A , 



(10.5) 



This equation must be satisfied for all possible small perturbations 6y (x). 

Integrating the second term in the integrand of the above equation by parts, we get 



rb 



3F d / 9F s 
3y dxydy', 



6y dx + 



3F 
9^ 



7*J 



0. 



(10.6) 



Now, if the end points are fixed then 5y = at x = a and x = b. Hence, the last term on 
the left-hand side of the above equation is zero. Thus, we obtain 



' r 3F d / 3F 
3y dx^3y', 



Sy dx = 0. 



(10.7) 



The above equation must be satisfied for all small perturbations 6y (x). The only way in 
which this is possible is for the expression enclosed in square brackets in the integral to be 
zero. Hence, the functional I attains an extremum value whenever 



d / 3F 



dx \ 3y ' 



(10.8) 



This condition is known as the Euler-Lagrange equation. 

Let us consider some special cases. Suppose that F does not explicitly depend on y. It 
follows that 3F/3y = 0. Hence, the Euler-Lagrange equation (10.8) simplifies to 



3F 
Sy 7 



const. 



(10.9) 
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Next, suppose that F does not depend explicitly on x. Multiplying Equation (10.8) by y 
we obtain 

However, 

d / , 3F \ . d / 3F \ .. 3F ,^ 

Thus, we get 

_d / , _3F \ , 9F „ j)F 
dx\ y dy') ~ V dy +V dy' 

Now, if F is not an explicit function of x then the right-hand side of the above equation is 
the total derivative of F, namely dF/dx. Hence, we obtain 



y'- F = const. (10.14) 

dy' 



which yields 



Returning to the case under consideration, we have F = y 1 +y' 2 , according to Equa- 
tion (10.1) and (10.3). Hence, F is not an explicit function of y, so Equation (10.9) yields 

9F y ' =c, (10.15) 



W ^+y' 2 ' 
where c is a constant. So, 

y' = = const. (10.16) 

VI — c 1 

Of course, y' = constant is the equation of a straight-line. Thus, the shortest distance 
between two fixed points in a plane is indeed a straight-line. 

10.3 Conditional Variation 

Suppose that we wish to find the function y (x) which maximizes or minimizes the func- 
tional 



I 



F(y,y',x)dx, (10.17) 



subject to the constraint that the value of 



rb 



G(y,y',x)dx (10.18) 
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remains constant. We can achieve our goal by finding an extremum of the new functional 
K = I + A J, where A(x) is an undetermined function. We know that 51 = 0, since the value 
of J is fixed, so if 5K = then 51 = as well. In other words, finding an extremum of K is 
equivalent to finding an extremum of I. Application of the Euler-Lagrange equation yields 



d / 3F\ _ 3F 
dx\dy'J dy 



d / 3[AG] ' 



9 [AG] 



0. 



(10.19) 



In principle, the above equation, together with the constraint (10.18), yields the functions 
A(x) andy(x). Incidentally, A is generally termed a Lagrange multiplier. If F and G have no 
explicit x-dependence then A is usually a constant. 

As an example, consider the following famous problem. Suppose that a uniform chain 
of fixed length I is suspended by its ends from two equal-height fixed points which are a 
distance a apart, where a < I. What is the equilibrium configuration of the chain? 

Suppose that the chain has the uniform density per unit length p. Let the x- and 
y-axes be horizontal and vertical, respectively, and let the two ends of the chain lie at 
(±a/2, 0). The equilibrium configuration of the chain is specified by the function y(x), for 
—a/2 < x < +a/2, where y(x) is the vertical distance of the chain below its end points at 
horizontal position x. Of course, y(— a/2) =y{+a/2) = 0. 

According to the discussion in Section 3.2, the stable equilibrium state of a conservative 
dynamical system is one which minimizes the system's potential energy. Now, the potential 
energy of the chain is written 



U = -pg 



y ds = -p g 



a/2 



y[1+y' 2 ] 1/2 dx, 



(10.20) 



-a/2 



where ds = y dx 2 + dy 2 is an element of length along the chain, and g is the acceleration 
due to gravity. Hence, we need to minimize U with respect to small variations in y{x). 
However, the variations in y (x) must be such as to conserve the fixed length of the chain. 
Hence, our minimization procedure is subject to the constraint that 



1 = 



ra/2 



ds = 



-a/2 



[1 + y /2 ] 1/2 dx 



(10.21) 



remains constant. 

It follows, from the above discussion, that we need to minimize the functional 



K = U + Al = 



a/2 



-a/2 



(-pgy + A) [1+y' 2 ] 1/2 dx, 



(10.22) 



where A is an, as yet, undetermined constant. Since the integrand in the functional does 
not depend explicitly on x, we have from Equation (10.14) that 



y' 2 (-p g y + A) [1 + y /2 r 1/2 -(-pgy+A) [1 + y > 2 ]" 2 = k, 



(10.23) 
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where k is a constant. This expression reduces to 



„«-(v + j9-,, 



(10.24) 



where A' = A/k, and h = — k/p g. 
Let 



A' + ^ = -coshz. 



Making this substitution, Equation (10.24) yields 

dz 



dx 



-hr'. 



(10.25) 



(10.26) 



Hence, 



z = -- + c, 



where c is a constant. It follows from Equation (10.25) that 

y (x) = -h [A' + cosh(-x/h + c)] . 



(10.27) 



(10.28) 



The above solution contains three undetermined constants, h, A', and c. We can eliminate 
two of these constants by application of the boundary conditions y{±a/2) = 0. This yields 



A' + cosh(^a/2ri + c) =0. 
Hence, c = 0, and A' = — cosh(a/2 h). It follows that 

y{x) = h [cosh(a/2h) — cosh(x/h)]. 



(10.29) 



(10.30) 



The final unknown constant, h, is determined via the application of the constraint (10.21). 
Thus, 



ra/2 



I = 



[1 +y ,2 ] 1/2 dx = 



r a/2 



-a/2 



cosh(x/h) dx = 2 h sinh(a/2h). 



(10.31) 



-a/2 



Hence, the equilibrium configuration of the chain is given by the curve (10.30), which is 
known as a catenary, where the parameter h satisfies 



in = sinh {-k) ■ 



(10.32) 
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10.4 Multi-Function Variation 

Suppose that we wish to maximize or minimize the functional 



rb 



I = 



F(yi>y2,---,y^,yi',y2>--->y^>*) dx - (10.33) 



Here, the integrand F is now a functional of the T independent functions y-i(x), for i = 1 , T . 
A fairly straightforward extension of the analysis in Section 10.2 yields T separate Euler- 
Lagrange equations, 

dx(au() 3ut °' (10.34) 

for i = 1 , T , which determine the T functions t)i(x). If F does not explicitly depend on the 
function u k then the kth Euler-Lagrange equation simplifies to 

(i const. (10.35) 



Likewise, if F does not explicitly depend on x then all T Euler-Lagrange equations simplify 
to 

u( — - F = const, (10.36) 

for i = 1 , T . 



10.5 Hamilton's Principle 

We saw, in Chapter 9, that we can specify the instantaneous configuration of a conserva- 
tive dynamical system with T degrees of freedom in terms of T independent generalized 
coordinates q i} for i = 1, T. Let K(q b q 2 , • • • , qjr, qi, q 2 , • • • , qjr,t) and U(q b q 2 , • • • , q^,t) 
represent the kinetic and potential energies of the system, respectively, expressed in terms 
of these generalized coordinates. Here, ' = d/dt. The Lagrangian of the system is defined 

L(q 1 ,q 2 ,---,q^, qi,q 2 , •••,q^,t) = K - U. (10.37) 

Finally, the T Lagrangian equations of motion of the system take the form 

for i = 1 , T . 

Note that the above equations of motion have exactly the same mathematical form as 
the Euler-Lagrange equations (10.34). Indeed, it is clear, from Section 10.4, that the T 
Lagrangian equations of motion (10.38) can all be derived from a single equation: namely, 

Uqi> qi, • • • , q^, q b q 2 , • • • , q^, t) dt = 0. (10.39) 
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In other words, the motion of the system in a given time interval is such as to maximize or 
minimize the time integral of the Lagrangian, which is known as the action integral. Thus, 
the laws of Newtonian dynamics can be summarized in a single statement: 

The motion of a dynamical system in a given time interval is such as to maxi- 
mize or minimize the action integral. 

(In practice, the action integral is almost always minimized.) This statement is known as 
Hamilton's principle, and was first formulated in 1834 by the Irish mathematician William 
Hamilton. 



10.6 Constrained Lagrangian Dynamics 



Suppose that we have a dynamical system described by two generalized coordinates, q-\ 
and q 2 . Suppose, further, that qi and q 2 are not independent variables. In other words, qi 
and q 2 are connected via some constraint equation of the form 



f(qi,q 2 ,t) =0, 



(10.40) 



where f is some function of three variables. This type of constraint is called a holonomic. [A 
general holonomic constraint is of the form f(q 1; q 2 , • • • , qjr, t) = 0.] Let L(q b q 2 , qi, q 2 ,t) 
be the Lagrangian. How do we write the Lagrangian equations of motion of the system? 
Well, according to Hamilton's principle, 





pt 2 




6 


Ldt = 


n 




ti 





9L 
3qi 



d / 9L 
dt V9q7 



Sqi + 



9L 
3q 2 



_d (dV 
dt[dq 2/ 



8q 2 > dt = 0. (10.41) 



However, at any given instant in time, 5q] and 5q 2 are not independent. Indeed, Equa- 
tion (10.40) yields 

6f = 5q, + 6q 2 = (10.42) 
9q, 9q 2 

at a fixed time. Eliminating 5q 2 from Equation (10.41), we obtain 



r*2 
ti 



_a_L___d /_a_L_ 

3q! dt\9q! 



1 



3f/3qi 



3L d / 3L 



3q; 



dt 



1 



9f/9q 2 



5q 7 dt = 0. (10.43) 



This equation must be satisfied for all possible perturbations bq] (t), which implies that the 
term enclosed in curly brackets is zero. Hence, we obtain 



9L/9q! - (d/dt) (3L/3qi) 9L/9q 2 - (d/dt) (9L/9q 2 ) 



9f/9qi 



9f/9q 2 



(10.44) 



One obvious way in which we can solve this equation is to separately set both sides equal 
to the same function of time, which we shall denote — A(t). It follows that the Lagrangian 
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-( 


' 3L\ 


3L 


-A(t) 


3f 




v 3qJ 


3qi 


3qi 


-( 


'3L\ 


3L 


-A(t) 


3f 


dt \ 


v 3q 2 J 


3q 2 


3q 2 



equations of motion of the system can be written 

-- 0, (10.45) 

= 0. (10.46) 

In principle, the above two equations can be solved, together with the constraint equation 
(10.40), to give qi(t), q 2 (t), and the so-called Lagrange multiplier A(t). Equation (10.45) 
can be rewritten 

d / 3K\ 3K 3U _ . 3f 
dt\3qi/ 3qi 3qi 3qi 
Now, the generalized force conjugate to the generalized coordinate q^ is [see Equation (9.8)] 

Q.^=~^ (10.48) 

3qi 

By analogy, it seems clear from Equation (10.47) that the generalized constraint force [i.e., 
the generalized force responsible for maintaining the constraint (10.40)] conjugate to qi 
takes the form 

Qi = A(t) p-, (10.49) 

with a similar expression for the generalized constraint force conjugate to q 2 . 

Suppose, now, that we have a dynamical system described by T generalized coordinates 
q i5 for i = 1 , J 7 , which is subject to the holonomic constraint 

f(qi,q 2 ,---,q^,t) = 0. (10.50) 

A simple extension of above analysis yields following the Lagrangian equations of motion 
of the system, 

dt^qi/ 3qi lJ 3qi 

for i = 1 , T . As before, 

Qi = A(t) ^- (10.52) 
3qi 

is the generalized constraint force conjugate to q^. Finally, the generalization to multiple 
holonomic constraints is straightforward. 

Consider the following example. A cylinder of radius a rolls without slipping down a 
plane inclined at an angle to the horizontal. Let x represent the downward displacement 
of the center of mass of the cylinder parallel to the surface of the plane, and let 4> represent 
the angle of rotation of the cylinder about its symmetry axis. The fact that the cylinder is 
rolling without slipping implies that x and (J) are interrelated via the well-known constraint 

f(x,(j)) =x-acj) = 0. (10.53) 
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Figure 10.2: A cylinder rolling down an inclined plane. 



The Lagrangian of the cylinder takes the form 



1 , 1 • , 
L = - rax + - 1 4> + mgx sin0, 



(10.54) 



where m is the cylinder's mass, I its moment of inertia, and g the acceleration due to 
gravity. 

Note that 3f/3x = 1 and 3f/34> = —a. Hence, Equation (10.51) yields the following 
Lagrangian equations of motion: 



rax — rag sin 9 — A = 0, 
I<jf> + Aa = 0. 

Equations (10.53), (10.55), and (10.56) can be solved to give 

.. g sin0 



a = 
A = 



1 +I/ma 2 ' 

g sin0 
1 +I/ma 2 ' 

rag sin0 



1 +ma 2 /I' 

The generalized constraint force conjugate to x is 

~ 3f mg sin0 
Qx = 3x" = ~1 +ma 2 /l' 



(10.55) 
(10.56) 



(10.57) 
(10.58) 
(10.59) 



(10.60) 



This represents the factional force acting parallel to the plane which impedes the down- 
ward acceleration of the cylinder, causing it to be less than the standard value mg sin0. 
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The generalized constraint force conjugate to 4> is 

~ , 3f mqasinfl 
ocp 1 + m a 2 /I 

This represents the factional torque acting on the cylinder which forces the cylinder to 
rotate in such a manner that the constraint (10.53) is always satisfied. 

Consider a second example. A bead of mass m slides without friction on a vertical 
circular hoop of radius a. Let r be the radial coordinate of the bead, and let 9 be its 
angular coordinate, with the lowest point on the hoop corresponding to = 0. Both 
coordinates are measured relative to the center of the hoop. Now, the bead is constrained 
to slide along the wire, which implies that 

f(r,9) =r-a = 0. (10.62) 

Note that 3f/3r = 1 and 3f/39 = 0. The Lagrangian of the system takes the form 

L = lra(f 2 + r 2 9 2 ) + mgrcos9. (10.63) 

Hence, according to Equation (10.51), the Lagrangian equations of motion of the system 
are written 

mf-mr9 2 -mg cos9-A = 0, (10.64) 
mr 2 9 + mgr sin 9 = 0. (10.65) 

The second of these equations can be integrated (by multiplying by 9), subject to the 
constraint (10.62), to give 

9 2 = -^cos9 + c, (10.66) 
a 

where c is a constant. Let v be the tangential velocity of the bead at the bottom of the 
hoop (i.e., at 9 = 0). It follows that 



9 2 = _£(cose-1) + ^r. (10.67) 

n rx <- 



Equations (10.62), (10.64), and (10.67) can be combined to give 



A = -m^3g cos9-2g + ^V (10.68) 



Finally, the constraint force conjugate to r is given by 

Qr = A^ = -m^3g cos9-2g + ^ . (10.69) 

This represents the radial reaction exerted on the bead by the hoop. Of course, there is no 
constraint force conjugate to 9 (since 3f/39 = 0) because the bead slides without friction. 
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10.7 Hamilton's Equations 

Consider a dynamical system with T degrees of freedom which is described by the gen- 
eralized coordinates q { , for i = 1 , J 7 . Suppose that neither the kinetic energy, K, nor the 
potential energy, U, depend explicitly on the time, t. Now, in conventional dynamical sys- 
tems, the potential energy is generally independent of the q^, whereas the kinetic energy 
takes the form of a homogeneous quadratic function of the c^. In other words, 

K = Y_ (10.70) 

where the mij depend on the q i; but not on the q t . It is easily demonstrated from the above 
equation that 

^q,— =2K. (10.71) 

Recall, from Section 9.8, that generalized momentum conjugate to the ith generalized 
coordinate is defined 

9L 9K .-.^^ 
Pi = xt- = jrr-, (10.72) 
oqi oqt 

where L = K — U is the Lagrangian of the system, and we have made use of the fact that 
U is independent of the q^ Consider the function 

H= Y_ qiPi"L= Y. qiPi-K + U. (10.73) 

If all of the conditions discussed above are satisfied then Equations (10.71) and (10.72) 
yield 

H = K + U. (10.74) 

In other words, the function H is equal to the total energy of the system. 
Consider the variation of the function H. We have 

Sqtpt + cuSpi- — Sqt — - — SqA . (10.75) 

The first and third terms in the bracket cancel, because p { = dL/dc\i. Furthermore, since 
Lagrange's equation can be written j> { = 3L/3qi (see Section 9.8), we obtain 

6H= Y_ (qiSpi-piSqi). (10.76) 

Suppose, now, that we can express the total energy of the system, H, solely as a function of 
the qi and the p i3 with no explicit dependence on the q^. In other words, suppose that we 
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can write H = H(qi,pi). When the energy is written in this fashion it is generally termed 
the Hamiltonian of the system. The variation of the Hamiltonian function takes the form 

A comparison of the previous two equations yields 

qt = (10.78) 

3pi 

Cl! ' (10.79) 



for i = 1 , J 7 . These IT first-order differential equations are known as Hamilton's equations. 
Hamilton's equations are often a useful alternative to Lagrange's equations, which take the 
form of T second-order differential equations. 

Consider a one-dimensional harmonic oscillator. The kinetic and potential energies of 
the system are written K = (1/2) mx 2 and U = (1/2) kx 2 , where x is the displacement, m 
the mass, and k > 0. The generalized momentum conjugate to x is 



r 3x 
Hence, we can write 



3K . „ „„. 

mx. (10.80) 



2 m 

So, the Hamiltonian of the system takes the form 



1 p 2 

K = -— . (10.81) 



H = K + U = l^ + lkx 2 . (10.82) 
2 m 2 



Thus, Hamilton's equations, (10.78) and (10.79), yield 



3H _ p_ 
3p m 
_3H 
~3x" 



(10.83) 

-kx. (10.84) 



Of course, the first equation is just a restatement of Equation (10.80), whereas the second 
is Newton's second law of motion for the system. 

Consider a particle of mass m moving in the central potential U(r). In this case, 

K = lm(f 2 + r 2 6 2 ), (10.85) 
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where r, 9 are polar coordinates. The generalized momenta conjugate to r and 9 are 

9K 

at 

3K 
99 



9K 

Pr = T7 = mr, (10.86) 
9r 

9K 

p = __ =mr 2 9, (10.87) 



respectively. Hence, we can write 



1 (»i , P 



2 N 



K = — [Vr + ^ • (10-88) 
2 m V r 2 / 



Thus, the Hamiltonian of the system takes the form 



H = ^ (p 2 + H|) +U(r). (10.89) 



In this case, Hamilton's equations yield 



t = 1^ = ?^, (10.90) 



9p r m' 

9H _pe 
9p mr 

which are just restatements of Equations (10.86) and (10.87), respectively, as well as 



9 = — = -^, (10.91) 



3H p e 2 9U _ _ 

p r = "^- = J \-^-> (10.92) 
9r m r 6 or 

9H 

pe = -^"=0. (10.93) 
99 

The last equation implies that 

Hi = r 2 e = h, (10.94) 
m 

where h is a constant. This can be combined with Equation (10.92) to give 

— = f = -f "IT. (10-95) 
m r 3 9r 

where V = U/m. Of course, Equations (10.94) and (10.95) are the conventional equations 
of motion for a particle moving in a central potential — see Chapter 5. 



10.8 Exercises 



10.1. A particle of mass m is placed at the top of a smooth vertical hoop of radius a. Calculate the 
reaction of the hoop on the particle as it slides down the hoop by means of the method of 
Lagrange multipliers. Find the height at which the particle falls off the hoop. 
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10.2. A uniform disk of mass m and radius a has a light string wrapped around its circumference 
with one end of the string attached to a fixed support. The disk is allowed to fall under 
gravity, unwinding the string as it falls. Solve the problem using the method of Lagrange 
multipliers. What is the tension in the string? 

10.3. Consider two particles of masses mi and rri2. Let mi be constrained to move on a circle of 
radius a in the z = plane, centered at x = y = 0. (Here, z measures vertical height). Let m.2 
be constrained to move on a circle of radius a in the z = c plane, centered on x = y = 0. A 
light spring of spring constant k and unstreatched length c is attached between the particles. 
Find the Lagrangian of the system. Solve the problem using Lagrange multipliers and give a 
physical interpretation for each multiplier. 

10.4. Find the Hamiltonian of a particle of mass m constrained to move under gravity on the 
inside of a sphere of radius a. Use the standard spherical polar coordinates and <$> as your 
generalized coordinates, where the axis of the coordinates points vertically downward. Find 
Hamilton's equations of motion for the system. 

10.5. A particle of mass m is subject to a central attractive force given by 

f = — e ^ 

where k and (3 are positive constants. Find the Hamiltonian of the particle. Compare the 
Hamiltonian to the total energy of the particle. Is the energy of the particle conserved? 
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1 1 Coupled Oscillations 



11.1 Introduction 

This chapter examines the motion of a many degree of freedom dynamical system which 
is perturbed from some equilibrium state. 

11.2 Equilibrium State 

Consider an T degree of freedom dynamical system described by the generalized coordi- 
nates q i; for i = 1 , J 7 . Suppose that the kinetic energy K and the potential energy U are 
not explicit functions of time. This implies that the system in question is isolated: i.e., it 
is not subject to any external forces or time-varying constraints. In virtually all dynamical 
systems of interest, the kinetic energy can be expressed as a quadratic form: i.e., 

^ = \ Y- m ij( £ 1i> £ l2,---,q^)qiqj. (11.1) 

i,j=1,.F 

Without loss of generality, we can specify that the weights my in the above form are 
invariant under interchange of the indices i and j: i.e., 

mij = m^. (11.2) 

Finally, the potential energy is written U = U(q 1 ,q 2 ,---,q^). 

Suppose that qi = q i0 , for i = 1 , T , corresponds to an equilibrium state of the system. 
It follows that q t = q i0 and q { = q\ = 0, for i = 1, J 7 , should be a possible solution of the 
equations of motion. Now, Lagrange's equations of motion for the system take the form 
[see Equation (9.22)] 

d / 3K \ 3K 311 ^ 
dtySqi/ 3q t 3q t 

for i = 1 , T. Here, we have made use of the definition L = K — U, and the fact that U is 
independent of the q^ Now, it is clear, from an examination of Equation (11.1), that every 
component making up the first two terms in the above equation contains either a q^ or a 
qj, for some j. But, we can set all of the generalized velocities and accelerations to zero 
in an equilibrium state of the system. Hence, the first two terms in the above equation are 
zero, and the condition for equilibrium reduces to 

Qto = _ 9u(qio»q2o,---,q^) =0 (114) 

3qi 

for i = 1 , T . In other words, q t = q i0 , for i = 1 , T , is an equilibrium state provided that all 
of the generalized forces, Qi [see Equation (9.8)], evaluated at q { = q i0 , are zero. Let us 
suppose that this is the case. 
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11.3 Stability Equations 

It is evident that if our system is initialized in some equilibrium state, with all of the qt set 
to zero, then it will remain in this state for ever. But what happens if the system is slightly 
perturbed from the equilibrium state? 
Let 

qi = qt0 + 6qi, (11.5) 

for i = 1, J 7 , where the 5q t are small. To lowest order in 5q i5 the kinetic energy (11.1) can 
be written 

K~i Y- MyHSqj, (11.6) 

where 

Mij =mij(qio,q2o,---,qjro), (11.7) 

and 

M i] = M n . (11.8) 

Note that the weights Mij in the quadratic form (11.6) are now constants. 

Taylor expanding the potential energy function about the equilibrium state, up to 
second-order in the 5q i3 we obtain 

U ~ U - Y_ Qto 5qi - \ Y_ G ii H 5 °iv (11.9) 

where U = U( qi , q2o> • • • , q^o) 5 the Qio are specified in Equation (11.4), and 

G = _9 2 U(q 1 o >q20) .-. >q ^)^ 
3 qi 9qj 

Now, we can set U to zero without loss of generality. Moreover, according to Equa- 
tion (11.4), the Q i0 are all zero. Hence, the expression (11.9) reduces to 

U --\Y- G v b ^H- Cii.ii) 

y=i,.F 

Note that, since 3 2 U/3 qi 3qj = 3 2 U/3 qj 3q i3 the constants weights Gij in the above quadratic 
form are invariant under interchange of the indices i and j: i.e., 

Gtj = Gji. (11.12) 

With K and U specified by the quadratic forms (11.6) and (11.11), respectively, La- 
grange's equations of motion (11.3) reduce to 

Y_ (Mij5qj-Gij6qj) =0, (11.13) 
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for i = IjJ 7 . Note that the above coupled differential equations are linear in the 8qi. 
It follows that the solutions are superposable. Let us search for solutions of the above 
equations in which all of the perturbed coordinates 5qt have a common time variation of 
the form 

8q i (t) = 8q i e^\ (11.14) 

for i = 1, J 7 . Now, Equations (11.13) are a set of T second-order differential equations. 
Hence, the most general solution contains 2J 7 arbitrary constants of integration. Thus, if 
we can find sufficient independent solutions of the form (11.14) to Equations (11.13) that 
the superposition of these solutions contains IT arbitrary constants then we can be sure 
that we have found the most general solution. Equations (11.13) and (11.14) yield 

JjGij-y 2 1^)6^=0, (11.15) 

which can be written more succinctly as a matrix equation: 

(G-y 2 M)5q = 0. (11.16) 

Here, G is the real [see Equation (11.10)] symmetric [see Equation (11.12)] T x T ma- 
trix of the Gij values. Furthermore, M is the real [see Equation (11.1)] symmetric [see 
Equation (11.8)] T x T matrix of the My values. Finally, Sq is the 1 x T vector of the bt\\ 
values, and is a null vector. 



11.4 More Matrix Eigenvalue Theory 

Equation (11.16) takes the form of a matrix eigenvalue equation: 

(G-AM)x = 0. (11.17) 

Here, G and M are both real symmetric matrices, whereas A is termed the eigenvalue, and 
x the associated eigenvector. The above matrix equation is essentially a set of T homo- 
geneous simultaneous algebraic equations for the components of x. As is well-known, a 
necessary condition for such a set of equations to possess a non-trivial solution is that the 
determinant of the matrix must be zero: i.e., 

|G-AM| = 0. (11.18) 

The above formula reduces to an J'th-order polynomial equation for A. Hence, we conclude 
that Equation (11.17) is satisfied by T eigenvalues, and T associated eigenvectors. 

We can easily demonstrate that the eigenvalues are all real. Suppose that A k and x k are 
the kth eigenvalue and eigenvector, respectively. Then we have 



Gx k = A k Mx k . 



(11.19) 
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Taking the transpose and complex conjugate of the above equation, and right multiplying 
by x k , we obtain 

x k T G* T x k = A k x k T M* T x k . (11.20) 

Here, T denotes a transpose, and * a complex conjugate. However, since G and M are 
both real symmetric matrices, it follows that G* T = G and M* T = M. Hence, 

x k T Gx k = A k x k T Mx k . (11.21) 
Next, left multiplying Equation (11.19) by x k T , we obtain 

x k T Gx k = A k x k T Mx k . (11.22) 
Taking the difference between the above two expressions, we get 

(A*-A k )x* T Mx k = 0. (11.23) 

Since x k T Mx k is not generally zero, except in the trivial case where x k is a null vector, we 
conclude that A£ = A k for all k. In other words, the eigenvalues are all real. It immediately 
follows that the eigenvectors can also be chosen to be all real. 

Consider two distinct eigenvalues, A k and A v , with the associated eigenvectors x k and 
x v , respectively. We have 

Gx k = A k Mx k , (11.24) 
Gxi = A v Mxt. (11.25) 

Right multiplying the transpose of Equation (11.24) by Xv, and left multiplying Equation 
(11.25) by x[, we obtain 

x[Gx! = A k x^Mx v , (11.26) 

x^Gxv = A^Mxv. (11.27) 
Taking the difference between the above two expressions, we get 

(Ak-AOx^Mxv = 0. (11.28) 

Hence, we conclude that 

x[Mx l = 0, (11.29) 

provided A k ^ A;. In other words, two eigenvectors corresponding to two different eigen- 
values are "orthogonal" to one another (in the sense specified in the above equation). 
Moreover, it is easily demonstrated that different eigenvectors corresponding to the same 
eigenvalue can be chosen in such a manner that they are also orthogonal to one another — 
see Section 8.5. Thus, we conclude that all of the eigenvectors are mutually orthogonal. 
Since Equation (11.17) only specifies the directions, and not the lengths, of the eigenvec- 
tors, we are free to normalize our eigenvectors such that 

x£Mx l = 6 kl , (11.30) 
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where 6 k t = 1 when k = I, and 5 kl = otherwise. Note, finally, that since the x k , for 
k = 1, J 7 , are mutually orthogonal, they are independent (i.e., one eigenvector cannot 
be expressed as a linear combination of the others), and completely span J'-dimensional 
vector space. 

11.5 Normal Modes 

It follows from Equation (11.14) and (11.15), plus the mathematical results contained in 
the previous section, that the most general solution to Equation (11.13) can be written 

6q(t) = Y_ 5 <kW, (11.31) 

k=1,.F 

where 

5q k (t) = (cXke+^ + Pke-^Xk. (11.32) 

Here, the A k and the x k are the eigenvalues and eigenvectors obtained by solving Equa- 
tion (11.17). Moreover, the cx k and |3 k are arbitrary constants. Finally, we have made use 
of the fact that the two roots of y 2 = A k are y = iv^k- 

According to Equation (11.31), the most general perturbed motion of the system con- 
sists of a linear combination of J 7 different modes. These modes are generally termed 
normal modes, since they are mutually orthogonal (because the x k are mutually orthog- 
onal) . Furthermore, it follows from the independence of the x k that the normal modes 
are also independent {i.e., one mode cannot be expressed as a linear combination of the 
others) . The kth normal mode has a specific pattern of motion which is specified by the 
kth eigenvector, x k . Moreover, the kth mode has a specific time variation which is deter- 
mined by the associated eigenvalue, A k . Recall that A k is real. Hence, there are only two 
possibilities. Either A k is positive, in which case we can write 

5q k (t) = (cxke^ + fae'^) x k , (11.33) 

where A k = y 2 , or A k is negative, in which case we can write 

5q k (t) = (<x k e +ia >* t + fae- la >* t )x k , (11.34) 

where A k = — cu k . In other words, if A k is positive then the kth normal mode grows or 
decays secularly in time, whereas if A k is negative then the kth normal mode oscillates in 
time. Obviously, if the system possesses one or more normal modes which grow secularly 
in time then the equilibrium about which we originally expanded the equations of motion 
must be an unstable equilibrium. On the other hand, if all of the normal modes oscillate in 
time then the equilibrium is stable. Thus, we conclude that whilst Equation (11.4) is the 
condition for the existence of an equilibrium state in a many degree of freedom system, 
the condition for the equilibrium to be stable is that all of the eigenvalues of the stability 
equation (11.17) must be negative. 
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The arbitrary constants cc k and (3^ appearing in expression (11.32) are determined 
from the initial conditions. Thus, if 6q (0) = 5q(t = 0) and 5q (0) = 6q(t = 0) then it is easily 
demonstrated from Equations (11.30)-(11.32) that 

x£M5q (0) = cc k + |3 k , (11.35) 

and 

Xk ~M5q (0) = v^K- |3 k ). (11.36) 

Hence, 

Note, finally, that since there are IT arbitrary constants (two for each of the T normal 
modes), we can be sure that Equation (11.31) represents the most general solution to 
Equation (11.13). 



11.6 Normal Coordinates 

Since the eigenvectors x k , for k = 1 , J 7 , span J'-dimensional vector space, we can always 
write the displacement vector 5q as some linear combination of the x k : i.e., 

6q(t)= ^n k (t)x k . (11.39) 

We can regard the n k (t) as a new set of generalized coordinates, since specifying the n k is 
equivalent to specifying the 6q k (and, hence, the q k ). The n k are usually termed normal 
coordinates. According to Equations (11.30) and (11.39), the normal coordinates can be 
written in terms of the 5q k as 

n k = x[M5q. (11.40) 

Let us now try to express K, U, and the equations of motion in terms of the n k . 
The kinetic energy can be written 

K = ^5q T M6q, (11.41) 
where use has been made of Equation (11.6). It follows from (11.39) that 

K = l T. ^liiX^Mxv. (11.42) 

k,l=1 , T 
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Finally, making use of the orthonormality condition (11.30), we obtain 

2 



K = \ Y_ Mk- dl-43) 

k=1,.F 



Hence, the kinetic energy K takes the form of a diagonal quadratic form when expressed 
in terms of the normal coordinates. 
The potential energy can be written 

U = -lsq T G5q, (11.44) 
where use has been made of Equations (11.11). It follows from (11.39) that 

u = ~l Y- MkMiX^Gx,. (11.45) 

lC,l=1 yF 

Finally, making use of Equation (11.19) and the orthonormality condition (11.30), we 
obtain 

u = ~ y. (1L46) 

Hence, the potential energy U also takes the form of a diagonal quadratic form when 
expressed in terms of the normal coordinates. 

Writing Lagrange's equations of motion in terms of the normal coordinates, we obtain 
[cf, Equation (11.3)] 

d / 3K \ 3K 311 
dt \ 3r|k J 3rik 3ri k 
for k = 1 , T. Thus, it follows from Equations (11.43) and (11.46) that 

Tik = A k n k , (11.48) 

for k = 1 , J 7 . In other words, Lagrange's equations reduce to a set of J 7 uncoupled simple 
harmonic equations when expressed in terms of the normal coordinates. The solutions to 
the above equations are obvious: i.e., 

Tu(t) = a k e + ^ 1 + $ k e-^\ (11.49) 

where a k and (3 k are arbitrary constants. Hence, it is clear from Equations (11.39) and 
(11.49) that the most general solution to the perturbed equations of motion is indeed 
given by Equations (11.31) and (11.32). 

In conclusion, the equations of motion of a many degree of freedom dynamical sys- 
tem which is slightly perturbed from an equilibrium state take a particularly simple form 
when expressed in terms of the normal coordinates. Each normal coordinate specifies the 
instantaneous displacement of an independent mode of oscillation (or secular growth) of 
the system. Moreover, each normal coordinate oscillates at a characteristic frequency (or 
grows at a characteristic rate), and is completely unaffected by the other coordinates. 
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11.7 Spring-Coupled Masses 



Consider the two degree of freedom dynamical system pictured in Figure 11.1. In this 
system, two point objects of mass m are free to move in one dimension. Furthermore, the 
masses are connected together by a spring of spring constant k, and are also each attached 
to fixed supports via springs of spring constant k'. 

Let qi and q 2 be the displacements of the first and second masses, respectively, from the 
equilibrium state. It follows that the extensions of the left-hand, middle, and right-hand 
springs are q 1a q 2 — qi, and — q 2 , respectively. The kinetic energy of the system takes the 
form 

m 

(11.50) 



m 



K = y(q 1 2 + q 2 2 ), 



whereas the potential energy is written 

1 



U 



k / q 1 2 + k(q 2 -q 1 ) z + k / q 2 2 



The above expression can be rearranged to give 

1 



U =2 



(k + k'H'-lkq^ + tk + k'jq! 



(11.51) 



(11.52) 



A comparison of Equations (11.50) and (11.52) with the standard forms (11.6) and 
(11.11) yields the following expressions for the mass matrix, M, and the force matrix, G: 



M = 



m \ 
m ) ' 

lc lc lc 
lc lc lc 



(11.53) 
(11.54) 



Now, the equation of motion of the system takes the form [see Equation (11.17)] 

(G-AM)x = 0, (11.55) 

where x is the column vector of the qi and q 2 values. The solubility condition for the above 
equation is 

|G-AM| = 0, (11.56) 
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or 

-k-k'-Am k 



0, (11.57) 



k — k -k'-Am 

which yields the following quadratic equation for the eigenvalue A: 

m 2 A 2 + 2m(k + k')A + k'(k' + 2k) = 0. (11.58) 

The two roots of the above equation are 

Ai = --, (11.59) 
m 

= _(2k ± JO 
m 

The fact that the roots are negative implies that both normal modes are oscillatory in 
nature: i.e., the original equilibrium is stable. The characteristic oscillation frequencies of 
the modes are 



r~T~ 2k + k' 

U> 2 = V~ A 2 = y - • (11.62) 



Now, the first row of Equation (11.55) gives 



qi 



(11.63) 



q 2 k+k' + Am 

Moreover, Equations (11.30) and (11.53) yield the following normalization condition for 
the eigenvectors: 

x£x k = m-\ (11.64) 
for k = 1,2. It follows that the two eigenvectors are 

Xl = (2m)- 1/2 (1, 1), (11.65) 
x 2 = (2mr 1/2 (1,-1). (11.66) 

According to Equations (11.61)-(11.62) and (11.65)-(11.66), our two degree of free- 
dom system possesses two normal modes. The first mode oscillates at the frequency cui, 
and is a purely symmetric mode: i.e., q! = q 2 . Note that such a mode does not stretch 
the middle spring. Hence, o>i is independent of k. In fact, cui is simply the characteristic 
oscillation frequency of a mass m on the end of a spring of spring constant k'. The second 
mode oscillates at the frequency cu 2 , and is a purely anti-symmetric mode: i.e., = — q 2 . 
Since such a mode stretches the middle spring, the second mode experiences a greater 
restoring force than the first, and hence has a higher oscillation frequency: i.e., cu 2 > o>i. 



164 



NEWTONIAN DYNAMICS 



m k M k m 



qi Q3 Q2 

Figure 11.2: A model triatomic molecule. 

Note, finally, from Equations (11.40) and (11.53), that the normal coordinates of the 
system are: 

Tli = ^/y(qi+q 2 ), (11.67) 

Tl2 = ^(qi-q 2 ). (11-68) 

When expressed in terms of these normal coordinates, the kinetic and potential energies 
of the system reduce to 

K = l^+Ai), (11-69) 
U = ^(cu^Tif + culnl), (11.70) 

respectively. 



11.8 Triatomic Molecule 

Consider the simple model of a linear triatomic molecule (e.g., carbon dioxide) illustrated 
in Figure 11.2. The molecule consists of a central atom of mass M flanked by two identical 
atoms of mass m. The atomic bonds are represented as springs of spring constant k. The 
linear displacements of the flanking atoms are qi and q 2 , whilst that of the central atom is 
q 3 . Let us investigate the linear modes of oscillation our model molecule. 
The kinetic energy of the molecule is written 

m -i JVl 

K = y(q 1 2 + q 2 2 ) + yq 3 2 , (11-71) 

whereas the potential energy takes the form 

U = ^(q 3 -qi) 2 + ^(q2-q 3 ) 2 . (11.72) 

Clearly, we have a three degree of freedom dynamical system. However, we can reduce 
this to a two degree of freedom system by only considering oscillatory modes of motion, 
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and, hence, neglecting translational modes. We can achieve this by demanding that the 
center of mass of the system remains stationary. In other words, we require that 



m(qi +q 2 ) + Mq 3 = 0. 
This constraint can be rearranged to give 



m 



Eliminating q 3 from Equations (11.71) and (11.72), we obtain 



K = 



m 



(1 + a)q 1 2 + 2<xq 1 q 2 + (1 + a) q 2 2 



and 



U 



(1 +2a + 2a / ) qf +4cc(l + a) q, q 2 + (1 +2a + 2a l ) q 2 



(11.73) 



(11.74) 



(11.75) 



(11.76) 



respectively, where a = m/M. 

A comparison of the above expressions with the standard forms (11.6) and (11.11) 
yields the following expressions for the mass matrix, M, and the force matrix, G: 



M = m 



-k 



1 + a a \ 
cc 1 + a J ' 

' 1 +2a + 2a 2 2a(1+a) 



(11.77) 
(11.78) 



v 2a(1 + a) 1 +2a + 2a 2 J ' 

Now, the equation of motion of the system takes the form [see Equation (11.17)] 

(G-AM)x = 0, (11.79) 

where x is the column vector of the q! and q 2 values. The solubility condition for the above 
equation is 

|G-AM| = 0, 



which yields the following quadratic equation for the eigenvalue A: 
(1+2 a) [m 2 A 2 + 2mk(l +a)A + k 2 (l +2a) 
The two roots of the above equation are 

Ai = , 

m 



0. 



A 2 



k(l +2<x) 



m 



(11.80) 
(11.81) 

(11.82) 
(11.83) 
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The fact that the roots are negative implies that both normal modes are indeed oscillatory 
in nature. The characteristic oscillation frequencies are 



u>i 



W2 



^^/M1±M. (11 .85, 



Equation (11.79) can now be solved, subject to the normalization condition (11.30), to 
give the two eigenvectors: 

Xl = (2m)- ,/2 (1,-1), (11.86) 
x 2 = (2m)- ,/2 (1 +2«r ,/2 (1, 1). (11.87) 

Thus, we conclude from Equations (11.74) and (11.84)-(11.87) that our model molecule 
possesses two normal modes of oscillation. The first mode oscillates at the frequency u>i, 
and is an anti-symmetric mode in which q! = — q 2 and q 3 = 0. In other words, in this mode 
of oscillation, the two end atoms move in opposite directions whilst the central atom re- 
mains stationary. The second mode oscillates at the frequency w 2 , and is a mixed symmetry 
mode in which q, = q 2 but q 3 = —2<xqi. In other words, in this mode of oscillation, the 
two end atoms move in the same direction whilst the central atom moves in the opposite 
direction. 

Finally, it is easily demonstrated that the normal coordinates of the system are 

Tli = ^(qi-q 2 ), (11-88) 



m = J^l±^(q 1+ q 2 ). (11.89) 



When expressed in terms of these coordinates, K and U reduce to 



K = \(r]f + T]l), (11-90) 
U = l^V + a.^ 2 ), (11.91) 



respectively. 



11.9 Exercises 

11.1. A particle of mass m is attached to a rigid support by means of a spring of spring constant k. 
At equilibrium, the spring hangs vertically downward. An identical oscillator is added to this 
system, the spring of the former being attached to the mass of the latter. Calculate the char- 
acteristic frequencies for one-dimensional vertical oscillations, and describe the associated 
normal modes. 
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11.2. A simple pendulum consists of a bob of mass m suspended by an inextensible light string of 
length I. From the bob of this pendulum, a second identical pendulum is suspended. Consider 
the case of small angle oscillations in the same vertical plane. Calculate the characteristic 
frequencies, and describe the associated normal modes. 

11.3. A thin hoop of radius a and mass m oscillates in its own plane (which is constrained to be 
vertical) hanging from a single fixed point. A small mass m slides without friction along the 
hoop. Consider the case of small oscillations. Calculate the characteristic frequencies, and 
describe the associated normal modes. 

11.4. A simple pendulum of mass m and length I is suspended from a block of mass M which is 
constrained to slide along a frictionless horizontal track. Consider the case of small oscilla- 
tions. Calculate the characteristic frequencies, and describe the associated normal modes. 

11.5. A thin uniform rod of mass m and length a is suspended from one end by a light string of 
length I. The other end of the string is attached to a fixed support. Consider the case of 
small oscillations in a vertical plane. Calculate the characteristic frequencies, and describe 
the associated normal modes. 

1 1.6. A triatomic molecule consists of three atoms of equal mass. Each atom is attached to the other 
two atoms via identical chemical bonds. The equilibrium state of the molecule is such that the 
masses are at the vertices of an equilateral triangle of side a. Modeling the chemical bonds 
as springs of spring constant k, and only considering motion in the plane of the molecule, 
find the vibrational frequencies and normal modes of the molecule. Exclude translational 
and rotational modes. 
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12 Gravitational Potential Theory 



12.1 Introduction 

This chapter employs gravitational potential theory in combination with Newtonian dy- 
namics to examine various interesting phenomena in the Solar System. 

12.2 Gravitational Potential 

Consider two point masses, m and m/, located at position vectors r and r', respectively. 
According to Section 5.3, the acceleration g of mass m due to the gravitational force 
exerted on it by mass m/ takes the form 

fr' — r) 

g=Gm'^4. (12.1) 
|r' — r\ i 

Now, the x-component of this acceleration is written 

9x = Gm ' [{x>-x)i + [y> - u ) 2 V (z' - z) 2 p/ 2 ' (12 ' 2) 
where r = (x, y, z) and r' = (x', y' , z'). However, as is easily demonstrated, 

(x'-x) 3 / 1 \ 



[(x' - x) 2 + (y ' - y ) 2 + (z' - zY\ 3 / 2 3x V [(x' - x) 2 + (y ' - y ) 2 + (z' - z) 2 ] V 1 ) ' 

(12.3) 

Hence, 

* = Gm s(i?^i)' (12 ' 4) 

with analogous expressions for g y and g z . It follows that 

g = -V<P, (12.5) 

where 

O = -t^t (12.6) 
\r' — r| 

is termed the gravitational potential. Of course, we can only write g in the form (12.5) 
because gravity is a conservative force — see Chapter 2. Note that gravitational potential, 
O, is directly related to gravitational potential energy, U. In fact, the potential energy of 
mass misU = mO. 



170 



NEWTONIAN DYNAMICS 



Now, it is well-known that gravity is a superposable force. In other words, the gravita- 
tional force exerted on some test mass by a collection of point masses is simply the sum of 
the forces exerted on the test mass by each point mass taken in isolation. It follows that 
the gravitational potential generated by a collection of point masses at a certain location 
in space is the sum of the potentials generated at that location by each point mass taken 
in isolation. Hence, using Equation (12.6), if there are N point masses, mi. (for i = 1 , N), 
located at position vectors r { , then the gravitational potential generated at position vector 
r is simply 

<D(r) =-G Y r^-r. (12.7) 
CTTn |ri " r| 

Suppose, finally, that, instead of having a collection of point masses, we have a con- 
tinuous mass distribution. In other words, let the mass at position vector r' be p(r') d 3 r', 
where p(r') is the local mass density, and dV a volume element. Summing over all space, 
and taking the limit dV — > 0, Equation (12.7) yields 



<D(r) = -G 



dV, (12.8) 



where the integral is taken over all space. This is the general expression for the gravita- 
tional potential, <D(r), generated by a continuous mass distribution, p(r). 



12.3 Axially Symmetric Mass Distributions 



At this point, it is convenient to adopt standard spherical coordinates, (r, 9, 40, aligned 
along the z-axis. These coordinates are related to regular Cartesian coordinates as follows 
(see Section A. 17): 



x = r sin cos 4>> 
y = r sin 9 sin 4), 
z = rcos9. 



(12.9) 
(12.10) 
(12.11) 



Consider an axially symmetric mass distribution: i.e., a p(r) which is independent of the 
azimuthal angle, 4>- We would expect such a mass distribution to generated an axially sym- 
metric gravitational potential, <D(r, 9). Hence, without loss of generality, we can set 4> = 
when evaluating O from Equation (12.8). In fact, given that dV = r' 2 sin 9' dr' d9' d4>' 
in spherical coordinates, this equation yields 



<D(r,9) =-G 



'OO 


'71 


Jo . 


L 



?2n _/2 



r' 2 p(r',9') sin9' 
r-r'| 



dr'd9'd4>', 



(12.12) 



with the right-hand side evaluated at 4> = 0. However, since p(r', 9') is independent of 4>', 
the above equation can also be written 



(D(r,9) - -2nG 



r' 2 p(r',9') sine'flr- 



dr' d9', 



(12.13) 
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where (■ • •) denotes an average over the azimuthal angle, <$>'. 
Now, 

Ir'-rr 1 = (r 2 -2r-r' + r /2 r 1/2 , 

and 

r ■ r' = rr'F, 

where (at $ = 0) 

F = sin 9 sin 0' cos cj)' + cos 9 cos 9'. 

Hence, 



r — r 



(r 2 -2rr'F + r 



'2)-1/2 



(12.14) 
(12.15) 
(12.16) 
(12.17) 



Suppose that r > r'. In this case, we can expand |r' — r| 1 as a convergent power series 
in r'/r, to give 



r — r = — 



(12.18) 



Let us now average this expression over the azimuthal angle, 4)'. Since (1) = 1, (cos 
0, and (cos 2 cj)'} = 1 /2, it is easily seen that 



(F) = cos 9 cos 9', 



1 



(F 2 ) = - sin 7 9 sin 2 9' + cos 2 9 cos 2 9 

1 2/3 ln 1\ /3 2rt/ r 

= _ + _ _ cos 2 9 cos 2 9 

3 3 \2 2 2 2 



2q/ 



(12.19) 



(12.20) 



Hence, 



r — r 



1 

r 

+ 



1 + ^— J cos 9 cos 9' 



(12.21) 



Now, the well-known Legendre polynomials, P n (x), are defined 

1 d n 



Pn(x) = 



2 n n! dx n 



V - 1 



for n = 0, co. It follows that 



PoW = 1, 
Pi(x) = x, 

P 2 (x) = ^(3x 2 -l), 



(12.22) 

(12.23) 
(12.24) 

(12.25) 
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etc. The Legendre polynomials are mutually orthogonal: i.e., 

rl 



PnM Pm(x) dx 



-1 



Pn(cos9) P m (cos9) sin0d0 



n+1/2" 



(12.26) 



Here, 5 nm is 1 if n = m, and otherwise. The Legendre polynomials also form a complete 
set: i.e., any well-behaved function of x can be represented as a weighted sum of the P n (x) . 
Likewise, any well-behaved (even) function of 9 can be represented as a weighted sum of 
the P n (cos9). 

A comparison of Equation (12.21) and Equations (12.23)-(12.25) makes it reasonably 
clear that, when r > r', the complete expansion of (|r' — r| _1 ) is 



1 



n=0,oo 



r'-r|- 1 ) = - Y_ - Pn(cos9)P n (cos9'). 



Similarly, when r < r', we can expand in powers of r/r' to obtain 



r'-r|- 1 ) = l (^)> n (cos9)P n (cos9') 



n=0,oo 



It follows from Equations (12.13), (12.27), and (12.28) that 

<D(r,9) = Y_ ^n(r)Pn(cos9), 

n=0,co 



where 



<Bn(r) 



2nG 

r n+1 



-27tGr n 



,/n+2 q/ 



Now, given that the P n (cos 9) form a complete set, we can always write 

P(t,9)= X 

Pu(t) P n (cos9). 



n=0,oo 



This expression can be inverted, with the aid of Equation (12.26), to give 



p n (r) = (u+1/2) 



p(r,9) P n (cos9) sin9d9. 



Hence, Equation (12.30) reduces to 
2nG 



O n (r) 



(n + 1/2)r n+1 



r /n+2 p n (r') dr'- 



27rGr n 
n+1/2j 



,/1-n 



(12.27) 



(12.28) 



(12.29) 



p(T',e')P n (cos9') sing'dr'dg' 



r^^p^eOPufcose') sing'dr'dg'. (12.30) 



(12.31) 



(12.32) 



p n (r') dr'. (12.33) 



Thus, we now have a general expression for the gravitational potential, <D(r, 9), generated 
by any axially symmetric mass distribution, p(r, 9). 
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12.4 Potential Due to a Uniform Sphere 



Let us calculate the gravitational potential generated by a sphere of uniform mass density 
y and radius a. Expressing p(r, 6) in the form (12.31), it is clear that 



Po(r) = 



y 



for r < a 
for r > a ' 



(12.34) 



with p n (r) = for n > 0. Thus, from (12.33), 



<D (r) = 



47tG y 



r /2 dr / -47tGy 



r'dr' 



for r < a, and 



O (r) 



4n Gy 



r /2 dr' 



(12.35) 



(12.36) 



for r > a, with <D n (r) = for n > 0. Hence, 



3)(r)=-^(3a 2 



r 2 ) = — G M 



(3 a 2 - r 2 ) 
2 a 3 



for r < a, and 



Ofr] 



47i Gy a 3 
3" T 



GM 



(12.37) 



(12.38) 



for r > a. Here, M = (47t/3) a 3 y is the total mass of the sphere. 

According to Equation (12.38), the gravitational potential outside a uniform sphere of 
mass M is the same as that generated by a point mass M located at the sphere's center. It 
turns out that this is a general result for any finite spherically symmetric mass distribution. 
Indeed, from the previous analysis, it is clear that p(r) = p (r) and O(r) = O (r) for a 
spherically symmetric mass distribution. Suppose that the mass distribution extends out 
to r = a. It immediately follows, from Equation (12.33), that 



<D(r) 



G 
r 



47tr /2 p(r / ) dr' 



G M 



(12.39) 



for r > a, where M is the total mass of the distribution. We, thus, conclude that Newton's 
laws of motion, in their primitive form, apply not only to point masses, but also to extended 
spherically symmetric masses. In fact, this is something which we have implicitly assumed 
all along in this book. 

According to Equation (12.37), the gravitational potential inside a uniform sphere is 
quadratic in r. This implies that if a narrow shaft were drilled though the center of the 
sphere then a test mass, m, moving in this shaft would experience a gravitational force 
acting toward the center which scales linearly in r. In fact, the force in question is given 
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axis of rotation 




o 



prolate 



oblate 



Figure 12.1: Prolate and oblate spheroids. 



by f T = — m30/3r = — (G mM/a 3 ) r. It follows that a test mass dropped into the shaft 
executes simple harmonic motion about the center of the sphere with period 



where g = G M/a 2 is the gravitational acceleration at the sphere's surface. 
12.5 Potential Outside a Uniform Spheroid 

Let us now calculate the gravitational potential generated outside a spheroid of uniform 
mass density y and mean radius a. A spheroid is the solid body produced by rotating an 
ellipse about a major or minor axis. Let the axis of rotation coincide with the z-axis, and 
let the outer boundary of the spheroid satisfy 



where e is the termed the ellipticity. Here, we are assuming that |e| <C 1, so that the 
spheroid is very close to being a sphere. If e > then the spheroid is slightly squashed 
along its symmetry axis, and is termed oblate. Likewise, if e < then the spheroid is 
slightly elongated along its axis, and is termed prolate — see Figure 12.1. Of course, if 
e = then the spheroid reduces to a sphere. 

Now, according to Equation (12.29) and (12.30), the gravitational potential generated 
outside an axially symmetric mass distribution can be written 




(12.40) 



r = ae(9) = a 1--eP 2 (cos9) 



(12.41) 




(12.42) 
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where 



Jn = -2?t G 



r 2+n p(r,9)P n (cos9) sinBdrdB. 



(12.43) 



Here, the integral is taken over the whole cross-section of the distribution in r-8 space. 
It follows that for a uniform spheroid 



J n = -In G y 



P n (cosS) 



ra e (d) 



r 2+n dr sinGde 



Hence, 



giving 



Jn 



271 Gy 

'(3 + n) 



P n (cos9)o| +n (9) sin9d9, 



(12.44) 



(12.45) 



2rt G y a 

(3 + n) 



3+n 



P n (cos9) 



P (cos 9) - - {3 + n) e P 2 (cos 9) 



sin9d9, (12.46) 



to first-order in e. It is thus clear, from Equation (12.26), that, to first-order in e, the only 
non-zero J n are 



An G y a 3 „ , . 
Jo = ^— = -GM, 

8n G y a 5 e 2 „ , , , 
, 2 = __2_= GMa'e, 



(12.47) 
(12.48) 



since M = (47t/3) a 3 y. 

Thus, the gravitational potential outside a uniform spheroid of total mass M, mean 
radius a, and ellipticity e, is 



^, ^ GM 2 G M a 2 _ . n . ml 2 . 
<D(r,9) = + - = — eP 2 (cos9) + 0{e 2 ). 



r 5 r 3 

In particular, the gravitational potential on the surface of the spheroid is 

GM 2 G M a 2 



(12.49) 



which yields 



<D(ae,e) 



a e 



+ 



GM 
a 



5 aa 



4 



eP 2 (cos9) + C(e 2 ), 



1 + — eP 2 (cos9) + 0(e 2 ) 
15 



(12.50) 



(12.51) 



where use has been made of Equation (12.41). 

Consider a self-gravitating spheroid of mass M, mean radius a, and ellipticity e: e.g., 
a star, or a planet. Assuming, for the sake of simplicity, that the spheroid is composed of 
uniform density incompressible fluid, the gravitational potential on its surface is given by 
Equation (12.51). However, the condition for an equilibrium state is that the potential be 
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constant over the surface. If this is not the case then there will be gravitational forces act- 
ing tangential to the surface. Such forces cannot be balanced by internal pressure, which 
only acts normal to the surface. Hence, from (12.51), it is clear that the condition for equi- 
librium is e = 0. In other words, the equilibrium configuration of a self-gravitating mass 
is a sphere. Deviations from this configuration can only be caused by forces in addition to 
self-gravity and internal pressure: e.g., centrifugal forces due to rotation, or tidal forces 
due to orbiting masses. 



12.6 Rotational Flattening 

Let us consider the equilibrium configuration of a self-gravitating spheroid, composed of 
uniform density incompressible fluid, which is rotating steadily about some fixed axis. Let 
M. be the total mass, a the mean radius, e the ellipticity and D. the angular velocity of 
rotation. Furthermore, let the axis of rotation coincide with the axis of symmetry, which is 
assumed to run along the z-axis. 

Let us transform to a non-inertial frame of reference which co-rotates with the spheroid 
about the z-axis, and in which the spheroid consequently appears to be stationary. From 
Chapter 7, the problem is now analogous to that of a non-rotating spheroid, except that 
the surface acceleration is written g = g g + g c , where g g = — VG>(r, 0) is the gravitational 
acceleration, and g c the centrifugal acceleration. The latter acceleration is of magnitude 
r sin 9 D 2 , and is everywhere directed away from the axis of rotation (see Figure 12.2 and 
Chapter 7) . The acceleration thus has components 

g c =r O 2 sinB (sin0, cos 9, 0) (12.52) 

in spherical polar coordinates. It follows that g c = — Vx, where 

X(r,8) = sin 2 6 = [1 - P 2 (cose)] (12.53) 

can be thought of as a sort of centrifugal potential. Hence, the total surface acceleration is 

g = -V(<D+x). (12-54) 

As before, the criterion for an equilibrium state is that the surface lie at a constant total 
potential, so as to eliminate tangential surface forces which cannot be balanced by internal 
pressure. Hence, assuming that the surface satisfies Equation (12.41), the equilibrium 
configuration is specified by 

0(a e ,e)+x(a e ,e) =c, (12.55) 

where c is a constant. It follows from Equations (12.51) and (12.53) that, to first-order in 
e and D 2 , 



G M 
a 



4 

1 + — eP 2 (cose) 
Id 



[1 -P 2 (cose)] ~ c, (12.56) 
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Figure 12.2: Rotational flattening. 



which yields 

5 D 2 a 3 

e = ?w- (12 - 57) 

We conclude, from the above expression, that the equilibrium configuration of a (rela- 
tively slowly) rotating self-gravitating mass distribution is an oblate spheroid: i.e., a sphere 
which is slightly flattened along its axis of rotation. The degree of flattening is proportional 
to the square of the rotation rate. Now, from (12.41), the mean radius of the spheroid is 
a, the radius at the poles (i.e., along the axis of rotation) is a p = a (1 — 2 e/3), and the 
radius at the equator (i.e., perpendicular to the axis of rotation) is a e = a (1 + e/3) — see 
Figure 12.2. Hence, the degree of rotational flattening can be written 

Qe " ^ = e = - A ^r. (12.58) 



a 4 GM 



Now, for the Earth, a = 6.37 x 10 6 m, Q = 7.27 x 1CT 5 rad./s, and M = 5.97 x 10 24 kg. 
Thus, we predict that 

e = 0.00429, (12.59) 
corresponding to a difference between equatorial and polar radii of 

Aa=Q e -Q p = ea = 27.3 km. (12.60) 

In fact, the observed degree of flattening of the Earth is e = 0.00335, corresponding to 
a difference between equatorial and polar radii of 21.4 km. The main reason that our 
analysis has overestimated the degree of rotational flattening of the Earth is that it models 
the terrestrial interior as a uniform density incompressible fluid. In reality, the Earth's core 
is much denser than its crust (see Exercise 12.1). 

For Jupiter, a = 6.92 x 10 7 m, Q = 1.76 x 10~ 4 rad./s, and M = 1 .90 x 10 27 kg. Hence, 
we predict that 

e = 0.101. (12.61) 
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Note that this degree of flattening is much larger than that of the Earth, due to Jupiter's 
relatively large radius (about 10 times that of Earth), combined with its relatively short 
rotation period (about 0.4 days) . In fact, the polar flattening of Jupiter is clearly apparent 
from images of this planet. The observed degree of polar flattening of Jupiter is actually 
e = 0.065. Our estimate of e is probably slightly too large because Jupiter, which is mostly 
gaseous, has a mass distribution which is strongly concentrated at its core (see Exercise 
12.1). 



12.7 McCullough's Formula 

According to Equations (12.43) and (12.48), if the Earth is modeled as spheroid of uniform 
density y then its ellipticity is given by 



r 2 y P 2 (cos0) d 3 r / I = 



1 



r 2 y (3 cos 2 - l) d 3 r / I , (12.62) 



where the integral is over the whole volume of the Earth, and I = (2/5) M q 2 would be 
the Earth's moment of inertia were it exactly spherical. Now, the Earth's moment of inertia 
about its axis of rotation is given by 



In = 



u 2 )yd 3 r = 



r 2 y (1 -cos 2 0) d 3 r. 



(12.63) 



Here, use has been made of Equations (12.9)-(12.11). Likewise, the Earth's moment of 
inertia about an axis perpendicular to its axis of rotation (and passing through the Earth's 
center) is 



I 



{y 1 + z 2 ) y d 3 r 



1 



r 2 y - sin 2 + cos 2 J d 3 r 



r 2 y (sin 2 sin 2 4) + cos 2 0) d 3 r 
1 



r 2 y (1 +cos 2 0) d 3 r, (12.64) 



since the average of sin 2 4> is 1 /2 for an axisymmetric mass distribution. It follows from 
the above three equations that 



Io 



III ~ h 
In 



(12.65) 



This result, which is known as McCullough's formula, demonstrates that the Earth's ellip- 
ticity is directly related to the difference between its principle moments of inertia. It turns 
out that McCullough's formula holds for any axially symmetric mass distribution, and not 
just a spheroidal distribution with uniform density. Finally, McCullough's formula can be 
combined with Equation (12.49) to give 



4M)= -^ + ity P!(m91 . 



(12.66) 
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Figure 12.3: Two orbiting masses. 

This is the general expression for the gravitational potential generated outside an axially 
symmetric mass distribution. The first term on the right-hand side is the monopole gravi- 
tational field which would be generated if all of the mass in the distribution were concen- 
trated at its center of mass, whereas the second term is the quadrupole field generated by 
any deviation from spherical symmetry in the distribution. 



12.8 Tidal Elongation 

Consider two point masses, m and m/, executing circular orbits about their common center 
of mass, C, with angular velocity cu. Let R be the distance between the masses, and p the 
distance between point C and mass m — see Figure 12.3. We know, from Section 6.3, that 

u>> = ^, (12.67) 



and 

m R, (12.68) 



r M 
where M = m + m'. 

Let us transform to a non-inertial frame of reference which rotates, about an axis per- 
pendicular to the orbital plane and passing through C, at the angular velocity <x>. In this 
reference frame, both masses appear to be stationary. Consider mass m. In the rotating 
frame, this mass experiences a gravitational acceleration 



a g = (12.69) 



Gm/ 
~R^ 

directed toward the center of mass, and a centrifugal acceleration (see Chapter 7) 

a c = w 2 p (12.70) 

directed away from the center of mass. However, it is easily demonstrated, using Equa- 
tions (12.67) and (12.68), that 

a c = a g . (12.71) 

In other words, the gravitational and centrifugal accelerations balance, as must be the case 
if mass m is to remain stationary in the rotating frame. Let us investigate how this balance 
is affected if the masses m and m' have /mite spatial extents. 
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Figure 12.4: Calculation of tidal forces. 



Let the center of the mass distribution m/ lie at A, the center of the mass distribution m 
at B, and the center of mass at C — see Figure 12.4. We wish to calculate the centrifugal and 
gravitational accelerations at some point D in the vicinity of point B. It is convenient to 
adopt spherical coordinates, centered on point B, and aligned such that the z-axis coincides 
with the line BA. 

Let us assume that the mass distribution m is orbiting around C, but is not rotating 
about an axis passing through its center, in order to exclude rotational flattening from 
our analysis. If this is the case then it is easily seen that each constituent point of m 
executes circular motion of angular velocity tu and radius p — see Figure 12.5. Hence, each 
constituent point experiences the same centrifugal acceleration: i.e., 

g c = -cu 2 pe z . (12.72) 

It follows that 

gc = -V X , (12.73) 

where 

X = cu 2 pz (12.74) 
is the centrifugal potential, and z = r cos 9. The centrifugal potential can also be written 

X = ^^Pi(cos9). (12.75) 

The gravitational acceleration at point D due to mass m/ is given by 

g g =-V<D', (12.76) 

where the gravitational potential takes the form 

, , G ttl' ,„ „ „„. 

CD' = . (12.77) 

R' 

Here, R' is the distance between points A and D. Note that the gravitational potential 
generated by the mass distribution m' is the same as that generated by an equivalent point 
mass at A, as long as the distribution is spherically symmetric, which we shall assume to 
be the case. 
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Figure 12.5: The center B of the mass distribution m orbits about the center of mass C in a 
circle of radius p. // the mass distribution is non-rotating then a non-central point D must 
maintain a constant spatial relationship to B. It follows that point D orbits some point C, 
which has the same spatial relationship to C that D has to B, in a circle of radius p. 



Now, 

R' = R — r, (12.78) 

— > — > 
where R' is the vector DA, and R the vector BA — see Figure 12.4. It follows that 

-V2 /„, „ ,\-1/2 



R' 1 = (R 2 -2R-r + r 2 ) = (R 2 -2Rr cosB + r 2 )" 

Expanding in powers of r/R, we obtain 

1 



Hence, 



R 



Gm' 



n=0,oo 



R L hi) P n(008 9) 



R 



1 +^P 1 (cos9) + ^P 2 (cos9) 



to second-order in r/R. 

Adding x and O we obtain 



x + ( i )/ - 



Gm ; 
R 



1 +^P 2 (cos9) 



(12.79) 



(12.80) 



(12.81) 



(12.82) 



to second-order in r/R. Note that x+ is the potential due to the net external force acting 
on the mass distribution m. This potential is constant up to first-order in r/R, because the 
first-order variations in x and <D' cancel one another. The cancellation is a manifestation 
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of the balance between the centrifugal and gravitational accelerations in the equivalent 
point mass problem discussed above. However, this balance is only exact at the center 
of the mass distribution m. Away from the center, the centrifugal acceleration remains 
constant, whereas the gravitational acceleration increases with increasing z. Hence, at 
positive z, the gravitational acceleration is larger than the centrifugal, giving rise to a 
net acceleration in the +z-direction. Likewise, at negative z, the centrifugal acceleration 
is larger than the gravitational, giving rise to a net acceleration in the — z-direction. It 
follows that the mass distribution m is subject to a residual acceleration, represented by 
the second-order variation in Equation (12.82), which acts to elongate it along the z-axis. 
This effect is known as tidal elongation. 

In order to calculate the tidal elongation of the mass distribution m we need to add the 
potential, x + ^'j due to the external forces, to the gravitational potential, O, generated 
by the distribution itself. Assuming that the mass distribution is spheroidal with mass m, 
mean radius a, and ellipticity e, it follows from Equations (12.41), (12.51), and (12.82) 
that the total surface potential is given by 

. , Gm Gm' 

X+O' + O ~ — 

a k 

~j5 e ™ cos e) " ™ cos e)> (12 - 83) 

where we have treated e and a/R as small quantities. As before, the condition for equi- 
librium is that the total potential be constant over the surface of the spheroid. Hence, we 
obtain 

15 m' /a\ 3 „^ 
e = — ; ^ (12.84) 



4 m VRy 

as our prediction for the ellipticity induced in a self-gravitating spherical mass distribution 
of total mass m and radius a by a second mass, m', which is in a circular orbit of radius R 
about the distribution. Thus, if a+ is the maximum radius of the distribution, and a the 
minimum radius (see Figure 12.6), then 
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Consider the tidal elongation of the Earth due to the Moon. In this case, we have 
a = 6.37 x 10 6 m, R = 3.84 x 10 8 m, m = 5.97 x 10 24 kg, and m' = 7.35 x 10 22 kg. Hence, 
we calculate that — e = 2.1 x 10~ 7 ', or 



We, thus, predict that tidal forces due to the Moon cause the Earth to elongate along the 
axis joining its center to the Moon by about 1 .3 meters. Since water is obviously more fluid 
than rock (especially on relatively short time-scales) most of this elongation takes place in 
the oceans rather than in the underlying land. Hence, the oceans rise, relative to the land, 
in the region of the Earth closest to the Moon, and also in the region furthest away. Since 
the Earth is rotating, whilst the tidal bulge of the oceans remains relatively stationary, the 
Moon's tidal force causes the ocean at a given point on the Earth's surface to rise and fall, 
by about a meter, twice daily, giving rise to the phenomenon known as the tides. 

Consider the tidal elongation of the Earth due to the Sun. In this case, we have a = 
6.37 x 10 6 m, R = 1.50 x 10 11 m, m = 5.97 x 10 24 kg, and m/ = 1.99 x 10 30 kg. Hence, we 
calculate that — e = 9.6 x 10 s , or 



Thus, the tidal elongation due to the Sun is about half that due to the Moon. It follows that 
the tides are particularly high when the Sun, the Earth, and the Moon lie approximately in 
a straight-line, so that the tidal effects of the Sun and the Moon reinforce one another. This 
occurs at a new moon, or at a full moon. These type of tides are called spring tides (note 
that the name has nothing to do with the season) . Conversely, the tides are particularly 
low when the Sun, the Earth, and the Moon form a right-angle, so that the tidal effects 
of the Sun and the Moon partially cancel one another. These type of tides are called neap 
tides. Generally speaking, we would expect two spring tides and two neap tides per month. 

In reality, the amplitude of the tides varies significantly from place to place on the 
Earth's surface, due to the presence of the continents, which impede the flow of the oceanic 
tidal bulge around the Earth. Moreover, there is a time-lag of approximately 12 minutes 
between the Moon being directly overhead (or directly below) and high tide, because of 
the finite inertia of the oceans. Similarly, the time-lag between a spring tide and a full 
moon, or a new moon, can be up to 2 days. 

12.9 Roche Radius 

Consider a spherical moon of mass m and radius a which is in a circular orbit of radius R 
about a spherical planet of mass m/ and radius a'. (Strictly speaking, the moon and the 
planet execute circular orbits about their common center of mass. However, if the planet 
is much more massive than the moon then the center of mass lies close to the planet's 
center.) According to the analysis in the previous section, a constituent element of the 



Aa = a + — a = — e a = 1 .34 m. 



(12.86) 



Aa = a+ — a = — e a = 0.61 m. 



(12.87) 
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moon experiences a force per unit mass, due to the gravitational field of the planet, which 
takes the form 

g ' = -V(x + 0'), (12.88) 

where 

x + 0' = (z 2 - x 2 /2 - y 2 /2) + const. (12.89) 

Here, (x, y, z) is a Cartesian coordinate system whose origin is the center of the moon, 
and whose z-axis always points toward the center of the planet. It follows that 

, 2 G m/ / x y \ _ 

g =— ^ (^--e x -|e y +ze z J . (12.90) 

This so-called tidal force is generated by the spatial variation of the planet's gravitational 
field over the interior of the moon, and acts to elongate the moon along an axis joining 
its center to that of the planet, and to compress it in any direction perpendicular to this 
axis. Note that the magnitude of the tidal force increases strongly as the radius, R, of 
the moon's orbit decreases. Now, if the tidal force becomes sufficiently strong then it can 
overcome the moon's self-gravity and thereby rip the moon apart. It follows that there is 
a minimum radius, generally referred to as the Roche radius, at which a moon can orbit a 
planet without being destroyed by tidal forces. 

Let us derive an expression for the Roche radius. Consider a small mass element at the 
point on the surface of the moon which lies closest to the planet, and at which the tidal 
force is consequently largest (i.e., x = y = 0, z = a). According to Equation (12.90), 
the mass experiences an upward (from the moon's surface) tidal acceleration due to the 
gravitational attraction of the planet of the form 

, 2Gm'a ,„„ „„. 

S' = R3 e z . (12.91) 

The mass also experiences a downward gravitational acceleration due to the gravitational 
influence of the moon which is written 

G ra ,„ „ „„„ 

g = r e z . (12.92) 

Thus, the effective surface gravity at the point in question is 
Note that if R < R c , where 

/ /\1/3 

R c =(2^-j a, (12.94) 

then the effective gravity is negative. In other words, the tidal force due to the planet is 
strong enough to overcome surface gravity and lift objects off the moon's surface. If this 
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is the case, and the tensile strength of the moon is negligible, then it is fairly clear that 
the tidal force will start to break the moon apart. Hence, R c is the Roche radius. Now, 
m'/m = (p'/p) (o//a) 3 , where p and p' are the mean mass densities of the moon and 
planet, respectively. Thus, the above expression for the Roche radius can also be written 



The above calculation is somewhat inaccurate, since it fails to take into account the 
inevitable distortion of the moon's shape in the presence of strong tidal forces. (In fact, the 
calculation assumes that the moon always remains spherical.) A more accurate calculation, 
which treats the moon as a self-gravitating incompressible fluid, yields 



It follows that if the planet and the moon have the same mean density then the Roche 
radius is 2.44 times the planet's radius. Note that small orbital bodies such as rocks, or 
even very small moons, can survive intact within the Roche radius because they are held 
together by internal tensile forces rather than gravitational attraction. However, this mech- 
anism becomes progressively less effective as the size of the body in question increases. Not 
surprisingly, virtually all large planetary moons occurring in the Solar System have orbital 
radii which exceed the relevant Roche radius, whereas virtually all planetary ring systems 
(which consist of myriads of small orbiting rocks) have radii which lie inside the relevant 
Roche radius. 

12.10 Precession and Forced Nutation of the Earth 

Consider the Earth-Sun system — see Figure 12.7. From a geocentric viewpoint, the Sun 
orbits the Earth counter-clockwise (looking from the north), once per year, in an approx- 
imately circular orbit of radius a s = 1.50 x 10 11 m. In astronomy, the plane of the Sun's 
apparent orbit relative to the Earth is known as the ecliptic plane. Let us define non-rotating 
Cartesian coordinates, centered on the Earth, which are such that the x- and y-axes lie in 
the ecliptic plane, and the z-axis is normal to this plane (in the sense that the Earth's north 
pole lies at positive z). It follows that the z-axis is directed toward a point in the sky 
(located in the constellation Draco) known as the north ecliptic pole. In the following, we 
shall treat the Oxuz system as inertial. This is a reasonable approximation because the 
orbital acceleration of the Earth is much smaller than the acceleration due to its diurnal 
rotation. It is convenient to parameterize the instantaneous position of the Sun in terms 
of a counter-clockwise (looking from the north) azimuthal angle A s that is zero on the 
positive x-axis — see Figure 12.7. 

Let cu be the Earth's angular velocity vector due to its daily rotation. This vector makes 
an angle 9 with the z-axis, where 9 = 23.44° is the mean inclination of the ecliptic to the 




(12.95) 




(12.96) 
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Figure 12.7: The Earth- Sun system. 



Earth's equatorial plane. Suppose that the projection of cu onto the ecliptic plane subtends 
an angle 4) with the u-axis, where <p is measured in a counter-clockwise (looking from the 
north) sense — see Figure 12.7. The orientation of the Earth's axis of rotation (which is, of 
course, parallel to cu) is thus determined by the two angles and <p. Note, however, that 
these two angles are also Euler angles, in the sense given in Chapter 8. Let us examine 
the Earth-Sun system at an instant in time, t = 0, when 4> = 0: i.e., when cu lies in the 
y-z plane. At this particular instant, the x-axis points towards the so-called vernal equinox, 
which is defined as the point in the sky where the ecliptic plane crosses the projection of 
the Earth's equator (i.e., the plane normal to cu) from south to north. A counter-clockwise 
(looking from the north) angle in the ecliptic plane that is zero at the vernal equinox is 
generally known as an ecliptic longitude. Thus, A s is the Sun's ecliptic longitude. 

According to Equation (12.66), the potential energy of the Earth-Sun system is written 

U = M ,„ = _£^1 + GM, (!„-!,) 

where M s is the mass of the Sun, M the mass of the Earth, I the Earth's moment of 
inertia about its axis of rotation, 1^ the Earth's moment of inertia about an axis lying in 
its equatorial plane, and P 2 (x) = (1/2) (3x 2 — 1). Furthermore, y s is the angle subtended 
between cu and r s , where r s is the position vector of the Sun relative to the Earth. 
It is easily demonstrated that (with 4> = 0) 



and 
Hence, 



cu = cu (0, sin 6, cos 9), 
r s = a s (cosA s , sinA s , 0). 

w ■ r s • ^ • , 

cosy s = - — n — r = sm0 smA s 
cu r s 



(12.98) 

(12.99) 
(12.100) 
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giving 

U = _GM 1 M + GM.(I, -IJ [3 tf 9 rt K _^ (12 . 101) 
cu 2 a? 



s 



Now, we are primarily interested in the motion of the Earth's axis of rotation over time- 
scales that are much longer than a year, so we can average the above expression over the 
Sun's orbit to give 

GM S M GM,(Ii|-I_l) [3 . 2n ,\ „ n * nn ^ 

U = — + — S in 2 e_i (12.102) 

a s 2a s 3 V 2 / 

(since the average of sin 2 A s over a year is 1/2). Thus, we obtain 

U = U -ecc s cos(20), (12.103) 



where U is a constant, and 
Here, 

is the Earth's ellipticity and 



«s = |l, u 2 . (12.104) 



e = 1,1 l± = 0.00335 (12.105) 



dA s /GM S V /2 min ,, 

n. = — ^-^-j (12.106) 



the Sun's apparent orbital angular velocity. 

According to Section 8.9, the rotational kinetic energy of the Earth can be written 

K = - (l ± 2 + l ± sin 2 cj) 2 + I„ a) 2 ) , (12.107) 
where the Earth's angular velocity 

cu = cos i> + 4> (12.108) 

is a constant of the motion. Here, 4> is the third Euler angle. Hence, the Earth's Lagrangian 
takes the form 



C = K - U = - (l ± 2 + I ± sin 2 cj> 2 + I,, cu 2 ) + e oc s cos(2 0) 



(12.109) 



where any constant terms have been neglected. One equation of motion which can imme- 
diately be derived from this Lagrangian is 
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which reduces to 

i_l e — f§ = o. (i2.ni) 

00 

Consider steady precession of the Earth's rotational axis, which is characterized by = 0, 
with both 4> and ijj constant. It follows, from the above equation, that such motion must 
satisfy the constraint 

"' L ' 0. (12.112) 



Thus, we obtain 



30 



1 . , 

-I ± sin(2 0)c() 2 -Ij| sin0cu(j)-2ea s sin(2 0) = 0, (12.113) 



where use has been made of Equations (12.108) and (12.109). Now, as can easily be 
verified after the fact, |<j)| < cu, so the above equation reduces to 

• 4ea s cos ^ 

7 i =0^, (12.114) 

In O) 



which can be integrated to give 
where 



4)~-n^t, (12.115) 

O. =^^cos0, (12.116) 

and use has been made of Equation (12.104). According to the above expression, the 
mutual interaction between the Sun and the quadrupole gravitational field generated by 
the Earth's slight oblateness causes the Earth's axis of rotation to precess steadily about the 
normal to the ecliptic plane at the rate — D.^. The fact that — £1$ is negative implies that the 
precession is in the opposite direction to the direction of the Earth's rotation and the Sun's 
apparent orbit about the Earth. Incidentally, the interaction causes a precession of the 
Earth's rotational axis, rather than the plane of the Sun's orbit, because the Earth's axial 
moment of inertia is much less than the Sun's orbital moment of inertia. The precession 
period in years is given by 

W) = ^ = ^) (12 . 117 ) 

Cl^ 3 e cos 

where T s (day) = cu/n s = 365.24 is the Sun's orbital period in days. Thus, given that 
e = 0.00335 and = 23.44°, we obtain 

T^ ~ 79, 200 years. (12.118) 

Unfortunately, the observed precession period of the Earth's axis of rotation about the 
normal to the ecliptic plane is approximately 25,800 years, so something is clearly missing 
from our model. It turns out that the missing factor is the influence of the Moon. 
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Using analogous arguments to those given above, the potential energy of the Earth- 
Moon system can be written 

GM m M GM m (I||-IJ n r miim 
U = + J! P2[cos(y m )], (12.119) 

where M m is the lunar mass, and a m the radius of the Moon's (approximately circular) 
orbit. Furthermore, y m is the angle subtended between tu and r m , where 

tu = w (—sin 9 sin 4), sin 9 coscf), cos 9) (12.120) 

is the Earth's angular velocity vector, and r m is the position vector of the Moon relative to 
the Earth. Here, for the moment, we have retained the <J) dependence in our expression 
for tu (since we shall presently differentiate by cj>, before setting 4> = 0) . Now, the Moon's 
orbital plane is actually slightly inclined to the ecliptic plane, the angle of inclination being 
L m = 5.16°. Hence, we can write 

r m ~ a m (cosA m , sinA m , i m sin(A m - G) n )) , (12.121) 

to first order in i m , where A m is the Moon's ecliptic longitude, and CD n is the ecliptic lon- 
gitude of the lunar ascending node, which is defined as the point on the lunar orbit where 
the Moon crosses the ecliptic plane from south to north. Of course, A m increases at the rate 
n m , where 

1/2 

(12.122) 



dA m /GM 



m at " V < ; 

is the Moon's orbital angular velocity. It turns out that the lunar ascending node precesses 
steadily, in the opposite direction to the Moon's orbital rotation, in such a manner that 
it completes a full circuit every 18.6 years. This precession is caused by the perturbing 
influence of the Sun — see Chapter 14. It follows that 

^=-O n , (12.123) 
at 

where 2n/D. n = 18.6 years. Now, from (12.120) and (12.121), 
tu • r 

cosy m = - — — — = sin 9 sin(A m — + i m cos 9 sin(A m — CD n ), (12.124) 



so (12.119) yields 



GM m M GM m (I||-I ± ) 



U ~ + ^4 — 3 sin z 9 sin z (A m - 4>) 

+3 L m sin(2 9) sin(A m - sin(A m - co n ) - 1] (12.125) 

to first order in i m . Given that we are interested in the motion of the Earth's axis of rotation 
on time-scales that are much longer than a month, we can average the above expression 
over the Moon's orbit to give 

U ~ Uq - e cc m cos(2 9) + e |3 m sin(2 9) cos(co n - 4>), (12.126) 
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[since the average of sin 2 (A m — $>) over a month is 1 /2, whereas that of sin(A m — 4>) sin(A m — 
CD m ) is (1/2) cos (co m — (J))]. Here, Uq is a constant, 

cx m = 1 1) M-^tl^, (12.127) 

3 
4 

and 



|3 m = 7 I|| Lm ^ m <, (12.128) 



^ = -^=0.0123 (12.129) 

is the ratio of the lunar to the terrestrial mass. Now, gravity is a superposable force, so the 
total potential energy of the Earth-Moon-Sun system is the sum of Equations (12.103) and 
(12.126). In other words, 

U = u;'-ea cos(2 9) + e |3 m sin(2 9) cos(co n - H (12.130) 

where Uq is a constant, and 

a = a s + <x m . (12.131) 
Finally, making use of (12.107), the Lagrangian of the Earth is written 

C = 1 (l ± 9 2 + I ± sin 2 9 cj) 2 + I|| a) 2 )+ea cos(2 9) - e |3 m sin(2 9) cos(co n - 4)), (12.132) 

where any constant terms have been neglected. Recall that w is given by (12.108), and is 
a constant of the motion. 

Two equations of motion that can immediately be derived from the above Lagrangian 

are 

d fd£\ dC . 

— ^ - — = 0, (12.133) 
dtV39; 39 

(The third equation, involving 4>, merely confirms that lv is a constant of the motion.) The 
above two equations yield 

-1 

= i ± e__i_L sin(2 6) dp 2 + 1 1 sin9 cu 4> + 2 e cc sin(2 9) 

+2e |3 m cos(2 9) cos(CD n - 4)), (12.135) 



= -^-(l ± sin 2 9(j) + I|| cos9cu) + e(3 m sin(2 9) sin(CD n -4)), (12.136) 



respectively. Let 

9(t) = Bo + eG^t), (12.137) 
cj)(t) = e<Mt), (12.138) 
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where 9 = 23.44° is the mean inclination of the ecliptic to the Earth's equatorial plane. To 
first order in e, Equations (12.135) and (12.136) reduce to 

~ I_l9i +I|| sinBoCucj)! + 2 a sin(2 9 ) + 2 |3 m cos(2 9 ) cos(O n t), (12.139) 
~ I ± sin 2 9 4>i -I|| sin 9 0)0] - |3 m sin(2 9 ) sin(O n t), (12.140) 

respectively, where use has been made of Equation (12.123). However, as can easily be 
verified after the fact, d/dt < cu, so we obtain 

• 4 g cos 9 2[3 m cos(2 9 ) riOM1 , 
4,1 " I„o,sin8 C ° S(Qnt) ' (12 - 141) 

9, ~ - 2|3 7 C ° S9 ° sin(Q n t). (12.142) 

I||U) 

The above equations can be integrated, and then combined with Equations (12.137) and 
(12.138), to give 

4>(t) = -O^t-Scj) sin(O n t), (12.143) 
0(t) = 9 + 69 cos(O n t), (12.144) 

where 

2 tu 
= 3e^< cos(2 9q) 
2 tuQ n sin9 

89 = 3 ei m u^n4 g (12.147) 
2 cu 12 n 

Incidentally, in the above, we have assumed that the lunar ascending node coincides with 
the vernal equinox at time t = (i.e., Q m = at t = 0), in accordance with our previous 
assumption that 4) = at t = 0. 

According to Equation (12.143), the combined gravitational interaction of the Sun and 
the Moon with the quadrupole field generated by the Earth's slight oblateness causes the 
Earth's axis of rotation to precess steadily about the normal to the ecliptic plane at the rate 
— O^. As before, the negative sign indicates that the precession is in the opposite direction 
to the (apparent) orbital motion of the sun and moon. The period of the precession in 
years is given by 

T,(yr) = ^= — 2 /' (d f ay ' (12.148) 
e(1 + u m /[T m (yr)] 2 ) cos9 

where T m (yr) = n s /n m = 0.081 is the Moon's (synodic) orbital period in years. Given that 
e = 0.00335, 9 = 23.44°, T s (day) = 365.24, and u m = 0.0123, we obtain 



Xjj ~ 27, 600 years. 



(12.149) 
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This prediction is fairly close to the observed precession period of 25, 800 years. The main 
reason that our estimate is slightly inaccurate is because we have neglected to take into 
account the small eccentricities of the Earth's orbit around the Sun, and the Moon's orbit 
around the Earth. 

The point in the sky toward which the Earth's axis of rotation points is known as the 
north celestial pole. Currently, this point lies within about a degree of the fairly bright 
star Polaris, which is consequently sometimes known as the north star or the pole star. It 
follows that Polaris appears to be almost stationary in the sky, always lying due north, and 
can thus be used for navigational purposes. Indeed, mariners have relied on the north star 
for many hundreds of years to determine direction at sea. Unfortunately, because of the 
precession of the Earth's axis of rotation, the north celestial pole is not a fixed point in the 
sky, but instead traces out a circle, of angular radius 23.44°, about the north ecliptic pole, 
with a period of 25,800 years. Hence, a few thousand years from now, the north celestial 
pole will no longer coincide with Polaris, and there will be no convenient way of telling 
direction from the stars. 

The projection of the ecliptic plane onto the sky is called the ecliptic, and coincides with 
the apparent path of the Sun against the backdrop of the stars. Furthermore, the projection 
of the Earth's equator onto the sky is known as the celestial equator. As has been previously 
mentioned, the ecliptic is inclined at 23.44° to the celestial equator. The two points in 
the sky at which the ecliptic crosses the celestial equator are called the equinoxes, since 
night and day are equally long when the Sun lies at these points. Thus, the Sun reaches 
the vernal equinox on about March 21st, and this traditionally marks the beginning of 
spring. Likewise, the Sun reaches the autumn equinox on about September 22nd, and this 
traditionally marks the beginning of autumn. However, the precession of the Earth's axis of 
rotation causes the celestial equator (which is always normal to this axis) to precess in the 
sky, and thus also causes the equinoxes to precess along the ecliptic. This effect is known 
as the precession of the equinoxes. The precession is in the opposite direction to the Sun's 
apparent motion around the ecliptic, and is of magnitude 1 .4° per century. Amazingly, this 
miniscule effect was discovered by the Ancient Greeks (with the help of ancient Babylonian 
observations). In about 2000 BC, when the science of astronomy originated in ancient 
Egypt and Babylonia, the vernal equinox lay in the constellation Aries. Indeed, the vernal 
equinox is still sometimes called the first point of Aries in astronomical texts. About 90 BC, 
the vernal equinox moved into the constellation Pisces, where it still remains. The equinox 
will move into the constellation Aquarius (marking the beginning of the much heralded 
"Age of Aquarius") in about 2600 AD. Incidentally, the position of the vernal equinox in the 
sky is of great significance in astronomy, since it is used as the zero of celestial longitude 
(much as Greenwich is used as the zero of terrestrial longitude) . 

Equations (12.143) and (12.144) indicate that the small inclination of the lunar orbit 
to the ecliptic plane, combined with the precession of the lunar ascending node, causes 
the Earth's axis of rotation to wobble sightly. This wobble is known as nutation, and is 
superimposed on the aforementioned precession. In the absence of precession, nutation 
would cause the north celestial pole to periodically trace out a small ellipse on the sky, the 
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sense of rotation being counter-clockwise. The nutation period is 18.6 years: i.e., the same 
as the precession period of the lunar ascending node. The nutation amplitudes in the polar 
and azimuthal angles and 4> are 



3 e i m ^i m T n (yr) 
2 T s (day) [T m (yr)F 



60 = - - f j_ ^ p cos 0o, (12.150) 



c- i _ 3 ei m ii m T n (yr) cos(2 o ) ri9in-n 
5CP " 2T s (day)[T m (yr)P sin0 o ' UAi5iJ 

respectively, where T n (yr) = u s /O n = 18.6. Given that e = 0.00335, O = 23.44°, i m = 
5.16°, T s (day) = 365.24, T m (yr) = 0.081, and |x m = 0.0123, we obtain 

60 = 8.2", (12.152) 
64) = 15.3". (12.153) 

The observed nutation amplitudes are 9.2" and 17.2", respectively. Hence, our estimates 
are quite close to the mark. Any inaccuracy is mainly due to the fact that we have neglected 
to take into account the small eccentricities of the Earth's orbit around the Sun, and the 
Moon's orbit around the Earth. The nutation of the Earth was discovered in 1 728 by the 
English astronomer James Bradley, and was explained theoretically about 20 years later 
by d'Alembert and L. Euler. Nutation is important because the corresponding gyration 
of the Earth's rotation axis appears to be transferred to celestial objects when they are 
viewed using terrestrial telescopes. This effect causes the celestial longitudes and latitudes 
of heavenly objects to oscillate sinusoidally by up to 20 " (i.e., about the maximum angular 
size of Saturn) with a period of 18.6 years. It is necessary to correct for this oscillation in 
order to accurately guide terrestrial telescopes to particular objects. 

Note, finally, that the type of forced nutation discussed above, which is driven by an 
external torque, is quite distinct from the free nutation described in Section 8.9. 



12.11 Potential Due to a Uniform Ring 

Consider a uniform ring of mass M and radius r, centered on the origin, and lying in the 
x-y plane. Let us consider the gravitational potential ®(r) generated by such a ring in the 
x-y plane (which corresponds to = 90°). It follows, from Section 12.3, that for r > a, 



<D(r) 



GM 



a 



n=0,oo 



r 



(12.154) 



However, P (0) = 1, P,(0) = 0, P 2 (0) = -1/2, P 3 (0) = 0, and P 4 (0) = 3/8. Hence, 



<D(r) = 



GM 



1 /OA 



1+7 - +^ ~ + 



4\rJ 64 \ rj 



aY 



(12.155) 
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Likewise, for r < a, 



giving 



(D(r) 



<D(r) = 



GM 



a 



GM 



a 



n=0,oo 



a 



(12.156) 



(12.157) 



12.12 Perihelion Precession of the Planets 

The Solar System consists of eight major planets (Mercury to Neptune) moving around 
the Sun in slightly elliptical orbits which are approximately co-planar with one another. 
According to Chapter 5, if we neglect the relatively weak interplanetary gravitational in- 
teractions then the perihelia of the various planets (i.e., the points on their orbits at which 
they are closest to the Sun) remain fixed in space. However, once these interactions are 
taken into account, it turns out that the planetary perihelia all slowly precess around the 
Sun. We can calculate the approximate rate of perihelion precession of a given planet by 
treating the other planets as uniform concentric rings, centered on the Sun, of mass equal 
to the planetary mass, and radius equal to the mean orbital radius. 1 This is equivalent to 
averaging the interplanetary gravitational interactions over the orbits of the other planets. 
It is reasonable to do this, since the precession period in question is very much longer than 
the orbital period of any planet in the Solar System. Thus, by treating the other planets 
as rings, we can calculate the mean gravitational perturbation due to these planets, and, 
thereby, determine the desired precession rate. 

We can conveniently index the planets in the Solar System such that Mercury is planet 
1, and Neptune planet 8. Let the M t and the R is for i = 1,8, be the planetary masses and 
orbital radii, respectively. Furthermore, let M be the mass of the Sun. It follows, from 
the previous section, that the gravitational potential generated at the ith planet by the Sun 
and the other planets is 



<D(R0 



GMc 

~rT 



4 R, T 64 IrJ 



1 + !(M 2 + 9 /RA< 



(12.158) 



Now, the radial force per unit mass acting on the ith planet is written f (R t ) 
giving 



-d<D(Ri)/dr, 



f(Ri) 



GM _G^ M 



R/ 



R t 2 z — ' 



3/RA 2 45 /RA 4 

^4 IrJ + 64 IrJ 



Stewart, American Jou. Physics 73, 730 (2005). 
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Ri 



16 VRj, 



(12.159) 



Hence, we obtain 

Rif'(Rt) 



2 G M G v— 



R. 2 



Ri, 



J>1 



where ' = d/dr. It follows that 

> 1-V2 



3 + 



f(Ri) 



1 + 



32 VRi, 

4 



2 V 



Ri 



16 VR 



(12.160) 



+ 



175 /Ri 



64 V Ri, 



1 + 



15 /RA 2 175 /RA 4 



8 VR 



64 \R 



+ 



(12.161) 



Thus, according to Equation (5.108), the apsidal angle for the ith planet is 

2 



4i 



J<1 



o, 



^ 8 \kJ ^ 64 UJ 



15/R.y 175 /R,y 

+ 8 [rJ + 64 



(12.162) 



Hence, the perihelion of the ith planet advances by 

2 



5*. = 



371 v— 



Mo 



V 

Ri, 



15/^ 175/RA 4 
+ 8 IrJ 64 IrJ 



+ 



Ri 

Ri 



1 + 



15 /RA 2 175 /RA 4 



8 V Ri 



+ 



64 \R L 



+ 



(12.163) 



radians per revolution around the Sun. Now, the time for one revolution is Ti = 2n/ oh, 
where u^ 2 = G Nlo/R?. Thus, the rate of perihelion precession, in arc seconds per year, is 
given by 



75 



Ti(yr) 



Mo 



Ri 

Ri. 



1 



15 /Ri\ 2 175 /Ri x 1 



8 V R t 



+ 



64 V Ri 



+ 



15 /RA 2 175/Ri x4 



1 + 



64 VR. 



^ + 



(12.164) 
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Planet 


M/M 




T(yr) 


R(au) 


Mercury 


1.66 x 10- 


-7 


0.241 


0.387 


Venus 


2.45 x 10" 


-6 


0.615 


0.723 


Earth 


3.04 x 10- 


-6 


1.000 


1.00 


Mars 


3.23 x 10- 


-7 


1.881 


1.52 


Jupiter 


9.55 x 10- 


-4 


11.86 


5.20 


Saturn 


2.86 x 10- 


-4 


29.46 


9.54 


Uranus 


4.36 x 10- 


-5 


84.01 


19.19 


Neptune 


5.18 x 10- 


-5 


164.8 


30.07 



Table 12.1: Data for the major planets in the Solar System, giving the planetary mass relative 
to that of the Sun, the orbital period in years, and the mean orbital radius relative to that of 
the Earth. 



Planet 




(6V) th 


Mercury 


5.75 


5.50 


Venus 


2.04 


10.75 


Earth 


11.45 


11.87 


Mars 


16.28 


17.60 


Jupiter 


6.55 


7.42 


Saturn 


19.50 


18.36 


Uranus 


3.34 


2.72 


Neptune 


0.36 


0.65 



Table 12.2: The observed perihelion precession rates of the planets compared with the theoret- 
ical precession rates calculated from Equation (12.164) and Table 12.1. The precession rates 
are in arc seconds per year. 
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Figure 12.8: The triangular points show the observed perihelion precession rates of the major 
planets in the Solar System, whereas the square points show the theoretical rates calculated 
from Equation (12.164) and Table 12.1. The precession rates are in arc seconds per year. 



Table 12.2 and Figure 12.8 compare the observed perihelion precession rates with the 
theoretical rates calculated from Equation (12.164) and the planetary data given in Ta- 
ble 12.1. It can be seen that there is excellent agreement between the two, except for 
the planet Venus. The main reason for this is that Venus has an unusually low eccentricity 
(e = 0.0068), which renders its perihelion point extremely sensitive to small perturbations. 



12.13 Perihelion Precession of Mercury 

If the calculation described in the previous section is carried out more accurately, taking 
into account the slight eccentricities of the planetary orbits, as well as their small mutual 
inclinations, and retaining many more terms in the expansions (12.155) and (12.157), 
then the perihelion precession rate of the planet Mercury is found to be 5.32 arc seconds 
per year. However, the observed precession rate is 5.75 arc seconds per year. It turns out 
that the cause of this discrepancy is the general relativistic correction to Newtonian gravity. 

General relativity gives rise to a small correction to the force per unit mass exerted by 
the Sun (mass M ) on a planet in a circular orbit of radius r, and angular momentum per 
unit mass Tt. In fact, the modified formula for f is 

GM 3 G M h 2 mitt « 
f ^ ~2 ^2^> (12.165) 
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where c is the velocity of light in vacuum. It follows that 

Hence, from Equation (5.108), the apsidal angle is 

^7t( 1+ |5)- (12.167) 

Thus, the perihelion advances by 

6nG M ,„ „ 

~ = — - (12.168) 

radians per revolution due to the general relativistic correction to Newtonian gravity. Here, 
use has been made of h 2 = G M r. It follows that the rate of perihelion precession due to 
the general relativistic correction is 

H = ^ (12.169) 

arc seconds per year, where R is the mean orbital radius in mean Earth orbital radii, and T 
is the orbital period in years. Hence, from Table 12.1, the general relativistic contribution 
to 6i|> for Mercury is 0.41 arc seconds per year. It is easily demonstrated that the corre- 
sponding contribution is negligible for the other planets in the Solar System. If the above 
calculation is carried out sightly more accurately, taking the eccentricity of Mercury's orbit 
into account, then the general relativistic contribution to 5\p becomes 0.43 arc seconds per 
year. It follows that the total perihelion precession rate for Mercury is 5.32 + 0.43 = 5.75 
arc seconds per year. This is in exact agreement with the observed precession rate. Indeed, 
the ability of general relativity to explain the discrepancy between the observed perihelion 
precession rate of Mercury, and that calculated from Newtonian dynamics, was one of the 
first major successes of this theory. 



12.14 Exercises 

12.1. Show that 

_ 5 - a D. 2 a 3 
6 4 GM ' 

for a self-gravitating, rotating spheroid of ellipticity e <C 1, mass M, mean radius a, and 
angular velocity O whose mass density varies as r~ a (where a < 3). Demonstrate that the 
above formula matches the observed rotational flattening of the Earth when a = 1 .09 and of 
Jupiter when a = 1 .79. 

12.2. The Moon's orbital period about the Earth is approximately 27.3 days, and is in the same 
direction as the Earth's axial rotation (whose period is 24 hours). Use this data to show that 
high tides at a given point on the Earth occur every 12 hours and 26 minutes. 
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12.3. Estimate the tidal elongation of the Moon due to the Earth. 

12.4. Consider an artificial satellite in a circular orbit of radius L about the Earth. Suppose that 
the normal to the plane of the orbit subtends an angle 6 with the Earth's axis of rotation. By 
approximating the orbiting satellite as a uniform ring, demonstrate that the Earth's oblateness 
causes the plane of the satellite's orbit to precess about the Earth's rotational axis at the rate 

o^-ru) cose - 

Here, w is the satellite's orbital angular velocity, e = 0.00335 the Earth's ellipticity and R the 
Earth's radius. Note that the Earth's axial moment of inertial is Iy ~ (1 /3) M R 2 , where M is 
the mass of the Earth. 

12.5. A sun-synchronous satellite is one which always passes over a given point on the Earth at the 
same local solar time. This is achieved by fixing the precession rate of the satellite's orbital 
plane such that it matches the rate at which the Sun appears to move against the background 
of the stars. What orbital altitude above the surface of the Earth would such a satellite need 
to have in order to fly over all latitudes between 50° N and 50° S? Is the direction of the 
satellite orbit in the same sense as the Earth's rotation (prograde), or the opposite sense 
(retrograde)? 
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13 The Three-Body Problem 



13.1 Introduction 

We saw earlier, in Section 6.2, that an isolated dynamical system consisting of two freely 
moving point masses exerting forces on one another — which is usually referred to as a two- 
body problem — can always be converted into an equivalent one-body problem. In particu- 
lar, this implies that we can exacdy solve a dynamical system containing two gravitationally 
interacting point masses, since the equivalent one-body problem is exactly soluble — see 
Chapter 5 and Section 6.3. What about a system containing three gravitationally interact- 
ing point masses? Despite hundreds of years of research, no exact solution of this famous 
problem — which is generally known as the three-body problem — has ever been found. It is, 
however, possible to make some progress by severely restricting the problem's scope. 

13.2 Circular Restricted Three-Body Problem 

Consider an isolated dynamical system consisting of three gravitationally interacting point 
masses, mi, m 2 , and m 3 . Suppose, however, that the third mass, m 3 , is so much smaller 
than the other two that it has a negligible effect on their motion. Suppose, further, that the 
first two masses, mi and m 2 , execute circular orbits about their common center of mass. 
In the following, we shall investigate this simplified problem, which is generally known as 
the circular restricted three-body problem. 

Let us define a Cartesian coordinate system (£,, n, Q in an inertial reference frame 
whose origin coincides with the center of mass, C, of the two orbiting masses. Further- 
more, let the orbital plane of these masses coincide with the £,-n plane, and let them both 
lie on the £,-axis at time t = — see Figure 13.1. Suppose that R is the constant distance 
between the two orbiting masses, r; the constant distance between mass mi and the ori- 
gin, and r 2 the constant distance between mass m 2 and the origin. Moreover, let o> be the 
constant orbital angular velocity. It follows, from Section 6.3, that 

O) 2 

ri_ 

r 2 

where M = mi + m 2 . 

It is convenient to choose our unit of length such that R = 1 , and our unit of mass such 
that G M. = 1 . It follows, from Equation (13.1), that tu = 1 . However, we shall continue to 
retain <x> in our equations, for the sake of clarity. Let ui = G mi , and u 2 = G m 2 = 1 — u-i . It 
is easily demonstrated that ri = u 2 , and r 2 = 1 — ri = ui . Hence, the two orbiting masses, 



GM 

m 2 
mi ' 



(13.1) 
(13.2) 
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Figure 13.1: The circular restricted three-body problem. 

mi and m 2 , have position vectors T] = (£,1, r|i, 0) and r 2 = (£,2, r| 2 , 0), respectively, where 
(see Figure 13.1) 



ri = |a 2 (— cos cut, — sin cut, 0), 
r 2 = \X] (cos cut, sin cut, 0). 



(13.3) 
(13.4) 



Let the third mass have position vector r = (£,, n, Q. The Cartesian components of the 
equation of motion of this mass are thus 



-M-i 5 ^2 



Pi 



P2 



3 ' 



ri - 1 1 Cti — -no (n-Ti2) 

r \ — — M-i 5 M-2 



Pi 

Pi P2 



P2 



3 ' 



where 



(13.5) 
(13.6) 
(13.7) 



Pi 



(£,-£, 1 ) 2 + (Tl-Tli) 2 + C 2 



p 2 2 = (£,-£, 2 ) 2 + (t 1 -n 2 ) 2 + C 2 



(13.8) 
(13.9) 



13.3 Jacobi Integral 

Consider the function 



C = 2 1^ + ^ Ulcu^-n t)-^ 2 -h 2 -C 2 . 
V Pi 92 J 



(13.10) 
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The time derivative of this function is written 



C = 




+ 2a)(£,Ti-ri£,)-2£,£,-2fifi-2CC 



(13.11) 



Moreover, it follows, from Equations (13.3)-(13.4) and (13.8)-(13.9), that 



Pi Pi = -(£1 £ + tii f|) + w (£,ni -n £,,) + £,£, + n r\ + CC, 
P2P2 = -(£, 2 t + T|2Ti) + o)(£,ri2 -ti £, 2 ) + £,£, + Tin + CC 



(13.12) 
(13.13) 



Combining Equations (13.5)-(13.7) with the above three expressions, we obtain (after 
considerable algebra) 



In other words, the function C — which is usually referred to as the Jacobi integral — is a 
constant of the motion. 

Now, we can rearrange Equation (13.10) to give 



where £ is the energy (per unit mass) of mass m.3, h = r x r its angular momentum 
(per unit mass), and cu = (0, 0, cu) the orbital angular velocity of the other two masses. 
Note, however, that h is not a constant of the motion. Hence, £ is not a constant of the 
motion either. In fact, the Jacobi integral is the only constant of the motion in the circular 
restricted three-body problem. Incidentally, the energy of mass m.3 is not a conserved 
quantity because the other two masses in the system are moving. 

13.4 Tisserand Criterion 

Consider a dynamical system consisting of three gravitationally interacting point masses, 
mi, m 2 , and m 3 . Let mass mi represent the Sun, mass m 2 the planet Jupiter, and mass 
m.3 a comet. Since the mass of a comet is very much less than that of the Sun or Jupiter, 
and the Sun and Jupiter are in (almost) circular orbits about their common center of mass, 
the dynamical system in question satisfies all of the necessary criteria to be considered an 
example of a restricted three-body problem. 

Now, the mass of the Sun is much greater than that of Jupiter. It follows that the 
gravitational effect of Jupiter on the cometary orbit is negligible unless the comet makes 
a very close approach to Jupiter. Hence, as described in Chapter 5, before and after such 
an approach, the comet executes a standard elliptical orbit about the Sun with fixed or- 
bital parameters: i.e., fixed major radius, eccentricity, and inclination to the ecliptic plane. 
However, in general, the orbital parameters before and after the close approach will not 
be the same as one another. Let us investigate further. 



C = 0. 



(13.14) 




(13.15) 
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Now, since mi > m.2, we have ui = Gmi ~ G (mi + m.2) = 1, and pi ~ r. Hence, 
according to Equations (5.51) and (5.60), the (approximately) conserved energy (per unit 
mass) of the comet before and after its close approach to Jupiter is 

S^l(^ + ^ + t 2 )-- = -^-. (13.16) 
2 v > r 2a 

Note that the comet's orbital energy is entirely determined by its major radius, a. (Inci- 
dentally, we are working in units such that the major radius of Jupiter's orbit is unity.) 
Furthermore, the (approximately) conserved angular momentum (per unit mass) of the 
comet before and after its approach to Jupiter is written h, where h is directed normal to 
the comet's orbital plane, and, from Equations (5.27) and (5.47), 

h 2 = a(1-e 2 ). (13.17) 

Here, e is the comet's orbital eccentricity. It follows that 

tu • h = cu h. cos I = \J a (1 — e 2 ) cos I, (13.18) 

since cu = 1 in our adopted system of units. Here, I is the angle of inclination of the normal 
to the comet's orbital plane to that of Jupiter's orbital plane. 

Let a, e, and I be the major radius, eccentricity, and inclination angle of the cometary 
orbit before the close encounter with Jupiter, and let a', e', and I' be the corresponding 
parameters after the encounter. It follows from Equations (13.15), (13.16), and (13.18), 
and the fact that C is conserved during the encounter, whereas £ and h. are not, that 

— h \/a (1 — e 2 ) cos I = — !— + J a' (1 - e' 2 ) cos I'. (13.19) 
2 a v 2 a v 

This result is known as the Tisserand criterion, and restricts the possible changes in the 
orbital parameters of a comet due to a close encounter with Jupiter (or any other massive 
planet). 

The Tisserand criterion is very useful. For instance, whenever a new comet is discov- 
ered, astronomers immediately calculate its Tisserand parameter, 

T, = - + 2 Ja{] -e 2 ) cos I. (13.20) 
a v 

If this parameter has the same value as that of a previously observed comet then it is quite 
likely that the new comet is, in fact, the same comet, but that its orbital parameters have 
changed since it was last observed, due to a close encounter with Jupiter. Incidentally, 
the subscript J in the above formula is to remind us that we are dealing with the Tisserand 
parameter for close encounters with Jupiter. (The parameter is, thus, evaluated in a system 
of units in which the major radius of Jupiter's orbit is unity). Obviously, it is also possible 
to calculate Tisserand parameters for close encounters with other planets. 

The Tisserand criterion is also applicable to so-called gravity assists, in which a space- 
craft gains energy due to a close encounter with a moving planet. Such assists are often 
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Figure 13.2: The co-rotating frame. 



employed in missions to the outer planets to reduce the amount of fuel which the space- 
craft must carry in order to reach its destination. In fact, it is clear, from Equations (13.16) 
and (13.19), that a space-craft can make use of a close encounter with a moving planet to 
increase (or decrease) its orbital major radius a, and, hence, to increase (or decrease) its 
total orbital energy. 

13.5 Co-Rotating Frame 

Let us transform to a non-inertial frame of reference rotating with angular velocity cu about 
an axis normal to the orbital plane of masses mi and m 2 , and passing through their center 
of mass. It follows that masses mi and m.2 appear stationary in this new reference frame. 
Let us define a Cartesian coordinate system (x, y, z) in the rotating frame of reference 
which is such that masses mi and rri2 always lie on the x-axis, and the z-axis is parallel 
to the previously defined £-axis. It follows that masses mi and m 2 have the fixed position 
vectors ri = \x 2 (— 1, 0, 0) and r 2 = u-i (1, 0, 0) in our new coordinate system. Finally, let 
the position vector of mass m 3 be r = (x, u, z) — see Figure 13.2. 

According to Chapter 7, the equation of motion of mass m 3 in the rotating reference 
frame takes the form 



f + 2 cu x r = — u-i 




- ^2 




— cu x (cu x r), 



(13.21) 



where cu = (0, 0, tu), and 



Pl 2 = (x+[i 2 ) 2 + y 2 + z\ 
p 2 2 = (x-M 2 + y 2 + z 2 . 



(13.22) 
(13.23) 



Here, the second term on the left-hand side of Equation (13.21) is the Coriolis acceler- 
ation, whereas the final term on the right-hand side is the centrifugal acceleration. The 
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components of Equation (13.21) reduce to 



x — 2cuu = ; 5 h cu x, (13.24) 

Pi P 2 

ij+2cux = -^-^ + a) 2 u, (13.25) 
Pi 3 Pi 



Hi Z U2 Z 



3 3 > 

Pi P 2 



which yield 



where 



Pi P2 2 

is the sum of the gravitational and centrifugal potentials. 
Now, it follows from Equations (13.27)-(13.29) that 

3U 



3y 

9U 
3z~ 



_d 
dt 



l(x 2 +y 2 + z 2 )+U 



(13.26) 



x-2wy = -—, (13.27) 



3U 

y+2wx = -— , (13.28) 
z = (13.29) 



U = -^-^-^(x 2 + u 2 ) (13.30) 



xx — 2wxi) = — x— — , (13.31) 

3x 

iji)+2cuxy = -t)^-, (13.32) 



zz = — z-^. (13.33) 



Summing the above three equations, we obtain 

= 0. (13.34) 



In other words, 

C = -2U-v 2 (13.35) 

is a constant of the motion, where v 2 = x 2 + y 2 + z 2 . In fact, C is the Jacobi integral 
introduced in Section 13.3 [it is easily demonstrated that Equations (13.10) and (13.35) 
are identical] . Note, finally, that the mass m.3 is restricted to regions in which 

-2U>C, (13.36) 

since v 2 is a positive definite quantity. 
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13.6 Lagrange Points 

Let us search for possible equilibrium points of the mass m 3 in the rotating reference frame. 
Such points are termed Lagrange points. Thus, in the rotating frame, the mass m 3 would 
remain at rest if placed at one of the Lagrange points. It is, thus, clear that these points 
are fixed in the rotating frame. Conversely, in the inertial reference frame, the Lagrange 
points rotate about the center of mass with angular velocity iv, and the mass m 3 would 
consequently also rotate about the center of mass with angular velocity uo if placed at one 
of these points (with the appropriate velocity) . In the following, we shall assumed, without 
loss of generality, that mi > Tin. 

The Lagrange points satisfy r = f = in the rotating frame. It thus follows, from 
Equations (13.27)-(13.29), that the Lagrange points are the solutions of 

au au au 

ir = ir = ir = °- (13 - 37) 

ox oy oz 



Now, it is easily seen that 



9U = / M-i M-2 
3z I pf p 2 3 



^ + ^ z. (13.38) 



Since the term in brackets is positive definite, we conclude that the only solution to the 
above equation is z = 0. Hence, all of the Lagrange points lie in the x-y plane. 
If z = then it is readily demonstrated that 

Hi pj 2 + \i 2 P 2 2 = x 2 + u 2 + m m-2, (13.39) 
where use has been made of the fact that + \i 2 = 1. Hence, Equation (13.30) can also 



be written 

U = -m I 

The Lagrange points thus satisfy 



U = -uJ± + #W 2 (- + 4W^. (13.40) 




3U = 8U Bp, + 8U 8pj = (13 41) 

3x 3pi 3x 3p2 3x 
3U 3U 3pi 3U 3p 2 



3y 



which reduce to 




= 0, (13.42) 



(13.43) 
(13.44) 



Now, one obvious solution of Equation (13.44) is xj = 0, corresponding to a Lagrange 
point which lies on the x-axis. It turns out that there are three such points. Li lies between 
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masses mi and m 2 , L 2 lies to the right of mass m 2 , and L 3 lies to the left of mass mi — see 
Figure 13.2. At the L] point, we have x = — u. 2 + Pi = Hi — P2 and p n = 1 — p 2 . Hence, from 
Equation (13.43), 

H2 P 2 3 -P2 + P2/3) 
3^ (l+p 2 + p 2 2 )(1-p 2 p- 

Assuming that p 2 € 1, we can find an approximate solution of Equation (13.45) by ex- 
panding in powers of p 2 : 

" = P2 + y + y + ^r + <?(pD- (13-46) 

This equation can be inverted to give 

P2 = a-^-^-^ + 0(oc 5 ), (13.47) 



where 



(13.48) 



3 Hi/ 

is assumed to be a small parameter. 

At the L 2 point, we have x = —\i 2 + ^ = Hi + p 2 and Pl = 1 + p 2 . Hence, from 
Equation (13.43), 

H2 P 2 3 +p 2 + p 2 2 /3) 



3^ (1+p 2 ) 2 (1-p 2 3 ) 
Again, expanding in powers of p 2 , we obtain 



(13.49) 



oc = P2-y + y + ff + C(P2 5 ), (13-50) 

P2 = a + y-y-^- + ( a5 )- ( 13 -5D 

Finally, at the L 3 point, we have x = — u. 2 — Pi = u-i — p 2 and p 2 = 1 + pi. Hence, from 
Equation (13.43), 

U- 2 (1- p 3)(1 + Pl ) 2 



Hi Pi 3 (Pi 2 + 3pi +3)' 
Let pi = 1 — (3. Expanding in powers of (3, we obtain 



(13.52) 



M-2 12 3 144 P 2 1567 ^, . ,„ o ^ 

— = — L H ' 1 — + C 3 , (13.53) 

m 7 49 343 ^ J 

where H2/H1 is assumed to be a small parameter. 
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Figure 13.3: The masses mi and and the five Lagrange points, L] to L 5 , calculated for 
yi 2 = 0.1. 



Let us now search for Lagrange points which do not lie on the x-axis. One obvious 
solution of Equations (13.41) and (13.42) is 



giving, from Equation (13.40), 



9U 9U rt 

-- = -- =0, (13.55) 

OPl OP2 

Pi = P2 = 1, (13.56) 



or 

(x + [i 2 ) 2 + y 2 = (x - 1 + \i 2 ) 2 + y 2 = 1 , (13.57) 
since u.i =1 — \x 2 . The two solutions of the above equation are 

x = i-M-2, (13.58) 
y = ±^-, (13-59) 

and specify the positions of the Lagrange points designated L 4 and L 5 . Note that point L 4 
and the masses m] and m 2 lie at the apexes of an equilateral triangle. The same is true for 
point L 5 . We have now found all of the possible Lagrange points. 

Figure 13.3 shows the positions of the two masses, mi and m 2 , and the five Lagrange 
points, L] to L 5 , calculated for the case where \i 2 = 0.1 . 
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13.7 Zero-Velocity Surfaces 

Consider the surface 

V(x,y,z) = C, (13.60) 

where 

V = _ 2 U = *H + ^ + x 2 + y 2 . (13.61) 

Pi P2 

Note that V > 0. It follows, from Equation (13.35), that if the mass m 3 has the Jacobi 
integral C, and lies on the surface specified in Equation (13.60), then it must have zero 
velocity. Hence, such a surface is termed a zero-velocity surface. The zero-velocity surfaces 
are important because they form the boundary of regions from which the mass m 3 is 
dynamically excluded: i.e., regions in which V < C. Generally speaking, the regions from 
which m 3 is excluded grow in area as C increases, and vice versa. 

Let Ct be the value of V at the L t Lagrange point, for i = 1,5. When \x 2 < 1 , it is easily 
demonstrated that 



Ci 


~ 3 + 3 4/ V 2 2/3 


-IOm-2/3, 


(13.62) 


c 2 


~ 3 + 3 4/ V 2 2/3 


-14^/3, 


(13.63) 


c 3 


- 3 + |x 2 , 




(13.64) 


c 4 


- 3- m-2, 




(13.65) 


c 4 


- 3 - |x 2 . 




(13.66) 



Note that d > C 2 > C 3 > C 4 = C 5 . 

Figures 13.4—13.8 show the intersection of the zero-velocity surface V = C with the 
x-y plane for various different values of C, and illustrate how the region from which m 3 
is dynamically excluded — which we shall term the excluded region — evolves as the value 
of C is varied. Of course, any point not in the excluded region is in the so-called allowed 
region. For C > Ci, the allowed region consists of two separate oval regions centered on 
vn-i and m 2 , respectively, plus an outer region which lies beyond a large circle centered 
on the origin. All three allowed regions are separated from one another by an excluded 
region — see Figure 13.4. When C = Ci, the two inner allowed regions merge at the Lj 
point — see Figure 13.5. When C = C 2 , the inner and outer allowed regions merge at the 
L 2 point, forming a horseshoe-like excluded region — see Figure 13.6. When C = C 3 , the 
excluded region splits in two at the L 3 point — see Figure 13.7. For C 4 < C < C 3 , the two 
excluded regions are localized about the L 4 and L 5 points — see Figure 13.8. Finally, for 
C < C 4 , there is no excluded region. 

Figure 13.9 shows the zero-velocity surfaces and Lagrange points calculated for the 
case u-2 = 0.01 . It can be seen that, at very small values of u. 2 , the Li and L 2 Lagrange 
points are almost equidistant from mass m 2 . Furthermore, mass m 2 , and the L 3 , L 4 , and L 5 
Lagrange points all lie approximately on a unit circle, centered on mass mi . It follows that, 
when u-2 is small, the Lagrange points L 3 , L 4 and L 5 all share the orbit of mass m 2 about mi 
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Figure 13.4: The zero-velocity surface V = C, where C > C], calculated for u. 2 = 0.1. The 
mass m 3 is excluded from the region lying between the two inner curves and the outer curve. 




Figure 13.5: The zero-velocity surface V = C, where C = Ci, calculated for \i 2 = 0.1. The 
mass m.3 is excluded from the region lying between the inner and outer curves. 
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Figure 13.6: The zero-velocity surface V = C, where C = C 2 , calculated for u. 2 = 0.1. The 
mass m 3 is excluded from the region lying between the inner and outer curve. 




Figure 13.7: The zero-velocity surface V = C, where C = C3, calculated for \i 2 = 0.1. The 
mass 1U3 is excluded from the regions lying inside the curve. 
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Figure 13.8: The zero-velocity surface V = C, where C 4 < C < C 3 , calculated for u. 2 = 0.1. 
The mass m 3 is excluded from the regions lying inside the two curves. 

(in the inertial frame) with C 3 being directly opposite m 2 , L 4 (by convention) 60° ahead of 
m 2 , and L 5 60° behind. 

13.8 Stability of Lagrange Points 

We have seen that the five Lagrange points, to L 5 , are the equilibrium points of mass m 3 
in the co-rotating frame. Let us now determine whether or not these equilibrium points 
are stable to small displacements. 

Now, the equations of motion of mass m.3 in the co-rotating frame are specified in Equa- 
tions (13.27)-(13.29). Note that the motion in the x-y plane is complicated by presence 
of the Coriolis acceleration. However, the motion parallel to the z-axis simply corresponds 
to motion in the potential U. Hence, the condition for the stability of the Lagrange points 
(which all lie at z = 0) to small displacements parallel to the z-axis is simply (see Sec- 
tion 3.2) 



This condition is satisfied everywhere in the x-y plane. Hence, the Lagrange points are all 
stable to small displacements parallel to the z-axis. It, thus, remains to investigate their 
stability to small displacements lying within the x-y plane. 

Suppose that a Lagrange point is situated in the x-y plane at coordinates (x , u , 0). 
Let us consider small amplitude x-y motion in the vicinity of this point by writing 




(13.67) 



x = x + 6x, 



(13.68) 
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Figure 13.9: The zero-velocity surfaces and Lagrange points calculated for \x 2 = 0.01. 
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y = yo + 6y, 

z = 0, 



(13.69) 
(13.70) 



where Sx and by are infinitesimal. Expanding U(x,y,0) about the Lagrange point as a 
Taylor series, and retaining terms up to second-order in small quantities, we obtain 



1 1, 

U = U + U x Sx + U y by + - U xx (5x) 2 + U xy bx Sy + - U yy {by) 2 , 



(13.71) 



where U = U(xo, yo,0), U x = 3U(xo,yo, 0)/3x, U xx = 3 2 U(x ,yo, 0)/3x 2 , etc. However, by 
definition, U x = U y = at a Lagrange point, so the expansion simplifies to 



U = U + ] - U xx (6x) 2 + U xy 6x by + 1 U yy (by) 2 . 



(13.72) 



Finally, substitution of Equations (13.68)-(13.70), and (13.72) into the equations of x-y 
motion, (13.27) and (13.28), yields 



6x-2 5y = -U xx 5x-U xy by, 
by+2bx = -U xy 5x-U yy 6y, 



(13.73) 
(13.74) 



since w = 1 . 

Let us search for a solution of the above pair of equations of the form 6x(t) = 6x exp(y t) 
and 6y(t) = 6y exp(yt). We obtain 



y 2 + U xx -2y + U xy 
2y + U xy y 2 + U y 



bx 
Syo 








(13.75) 



This equation only has a non-trivial solution if the determinant of the matrix is zero. 
Hence, we get 

y 4 + (4 + U xx + U yy ) y 2 + (U xx U yy - U 2 ) = 0. (13.76) 



Now, it is convenient to define 

A = 

B = 
C = 
D = 



Pi Pi 



Pi P 2 



M-i (* + Ma) , Mg (* ~ M-i ) 



Pi 



Pi 



M-i (* + M2) 2 _M- 2 (x- m) 2 



Pi 



P2 



(13.77) 
(13.78) 
(13.79) 
(13.80) 
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where all terms are evaluated at the point (xo, t)o> 0). It thus follows that 

U xx = A-D-1, (13.81) 
U yv = A-B-1, (13.82) 
U xy = -C. (13.83) 

Consider the co-linear Lagrange points, Li, L 2 , and L 3 . These all lie on the x-axis, and 
are thus characterized by y =0, p 2 = (x + |x 2 ) 2 , and pf = (x — u-i) 2 . It follows, from the 
above equations, that B = C = and D = 3 A. Hence, U xx = — 1 — 2 A, U yy = A — 1, and 
U xy = 0. Equation (13.76) thus yields 

r 2 + (2-A)T+(l -A)(1 +2 A) =0, (13.84) 

where V = y 1 . Now, in order for a Lagrange point to be stable to small displacements, all 
four of the roots, y, of Equation (13.76) must be purely imaginary. This, in turn, implies 
that the two roots of the above equation, 



A -2 ±, /A (9 A -8) 
r = y — , (13.85) 

must both be real and negative. Thus, the stability criterion is 

|<A<1. (13.86) 

Figure 13.10 shows A calculated at the three co-linear Lagrange points as a function of u. 2 , 
for all allowed values of this parameter (i.e., < \i z < 0.5). It can be seen that A is always 
greater than unity for all three points. Hence, we conclude that the co-linear Lagrange 
points, Li, L 2 , and L 3 , are intrinsically unstable equilibrium points in the co-rotating frame. 
Let us now consider the triangular Lagrange points, L 4 and L 5 . These points are char- 



acterized by pi = p 2 = 1 . It follows that A = 1, B = 9/4, C = ± a/27/16 (1 - 2 \i 2 ), and 



D = 3/4. Hence, U xx = -3/4, U yy = -9/4, and U xy = ^27/16(1 -2u- 2 ), where the 
upper/lower signs corresponds to L 4 and L 5 , respectively. Equation (13.76) thus yields 

r 2 + r + ^Mi -wz) =o (13.87) 

for both points, where V = y 2 . As before, the stability criterion is that the two roots 
of the above equation must both be real and negative. This is the case provided that 
1 > 27 u. 2 (1 — \x 2 ), which yields the stability criterion 



0.0385. (13.88) 
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Figure 13.10: The solid, short-dashed, and long-dashed curves show A as a function of u. 2 at 
the Li, L 2 , and L 3 Lagrange points. 



In unnormalized units, this criterion becomes 



7U2 



1Ui + TU-2 



< 0.0385. 



(13.89) 



We thus conclude that the L 4 and L 5 Lagrange points are stable equilibrium points, in 
the co-rotating frame, provided that mass m 2 is less than about 4% of mass mi . If this 
is the case then mass m 3 can orbit around these points indefinitely. In the inertial frame, 
the mass will share the orbit of mass m 2 about mass mi, but will stay approximately 
60° ahead of mass m 2 , if it is orbiting the L 4 point, or 60° behind, if it is orbiting the L 5 
point — see Figure 13.9. This type of behavior has been observed in the Solar System. For 
instance, there is a sub-class of asteroids, known as the Trojan asteroids, which are trapped 
in the vicinity of the L 4 and L 5 points of the Sun- Jupiter system (which easily satisfies the 
above stability criterion), and consequently share Jupiter's orbit around the Sun, staying 
approximately 60° ahead of, and 60° behind, Jupiter, respectively. Furthermore, the L 4 and 
L 5 points of the Sun-Earth system are occupied by clouds of dust. 
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14 Lunar Motion 



14. 1 Historical Background 

The motion of the Planets around the Sun is fairly accurately described by Kepler's laws 
(see Chapter 5) . Likewise, to a first approximation, the motion of the Moon around the 
Earth can also be accounted for via these laws. However, unlike the planetary orbits, the 
deviations of the lunar orbit from a Keplerian ellipse are sufficiently large that they are 
easily apparent to the naked eye. Indeed, the largest of these deviations, which is gen- 
erally known as evection, was discovered in ancient times by the Alexandrian astronomer 
Claudius Ptolemy (90-168 AD). Moreover, the next largest deviation, which is called vari- 
ation, was first observed by Tycho Brahe (1546-1601) without the aid of a telescope. 
Another non-Keplerian feature of the lunar orbit, which is sufficient obvious that it was 
known to the ancient Greeks, is the fact that the lunar perigee (i.e., the point of closest 
approach to the Earth) precesses (i.e., orbits about the Earth in the same direction as the 
Moon) at such a rate that it completes a full circuit every 8.85 years. 1 The ancient Greeks 
also noticed that the lunar ascending node (i.e., the point at which the Moon passes through 
the fixed plane of the Earth's orbit around the Sun from south to north) regresses (i.e., or- 
bits about the Earth in the opposite direction to the Moon) at such a rate that it completes 
a full circuit every 18.6 years. Of course, according to standard two-body orbit theory, 
both the lunar perigee and ascending node should be stationary (see Chapter 5). 

Newton demonstrated, in Book III of his Principia, that the deviations of the lunar orbit 
from a Keplerian ellipse are due to the gravitational influence of the Sun, which is suffi- 
ciently large that it is not completely negligible compared with the mutual gravitational 
attraction of the Earth and the Moon. However, Newton was not able to give a complete 
account of these deviations, due to the complexity of the equations of motion which arise 
in a system of three mutually gravitating bodies (see Chapter 13). In fact, Clairaut (1713- 
1765) is generally credited with the first reasonably accurate and complete theoretical 
explanation of the Moon's orbit. His method of calculation makes use of an expansion of 
the lunar equations of motion in terms of various small parameters. Clairaut, however, 
initially experienced difficulty in accounting for the precession of the lunar perigee. In- 
deed, his first calculation overestimated the period of this precession by a factor of about 
two, leading him to question Newton's inverse-square law of gravitation. Later on, he 
realized that he could indeed account for the precession, in terms of standard Newto- 
nian dynamics, by continuing his expansion in small parameters to higher order. After 
Clairaut, the theory of lunar motion was further elaborated in major works by D'Alembert 
(1717-1783), Euler (1707-1783), Laplace (1749-1827), Damoiseau (1768-1846), Plana 
(1781-1864), Poisson (1781-1840), Hansen (1795-1874), De Pontecoulant (1795-1824), 



1 Note that this precession rate is about 10 4 times greater than any of the planetary perihelion precession 
rates discussed in Section 12.12. 
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J. Herschel (1792-1871), Airy (1801-1892), Delaunay (1816-1872), G.W. Hill (1836- 
1914), and E.W. Brown (1836-1938). The fact that so many celebrated mathematicians 
and astronomers devoted so much time and effort to lunar theory is a tribute to its inherent 
difficulty, as well as its great theoretical and practical interest. Indeed, for a period of about 
one hundred years (between 1767 and about 1850) the so-called method of lunar distance 
was the principal means used by mariners to determine longitude at sea. This method 
depends crucially on a precise knowledge of the position of the Moon in the sky as a func- 
tion of time. Consequently, astronomers and mathematicians during the period in question 
were spurred to make ever more accurate observations of the Moon's orbit, and to develop 
lunar theory to greater and greater precision. An important outcome of these activities 
were various tables of lunar motion (e.g., those of Mayer, Damoiseau, Plana, Hansen, and 
Brown), most of which were published at public expense. Finally, it is worth noting that 
the development of lunar theory gave rise to many significant advances in mathematics 
and dynamics, and was, in many ways, a precursor to the discovery of chaotic motion in 
the 20th century (see Chapter 15). 

This chapter contains an introduction to lunar theory in which approximate expressions 
for evection, variation, the precession of the perigee, and the regression of the ascending 
node are derived from the laws of Newtonian dynamics. 2 



14.2 Preliminary Analysis 

Let r E and r M be the position vectors of the Earth and Moon, respectively, in a non-rotating 
reference frame in which the Sun is at rest at the origin. Treating this reference frame as 
inertial (which is an excellent approximation, given that the mass of the Sun is very much 
greater than that of the Earth or the Moon — see Chapter 6), the Earth's equation of motion 
becomes (see Chapter 5) 



r E = -u' 2 a' A — ^, (14.1) 
|r E | 3 



where n' = 0.98560025° per day and a' = 149,598,261 km are the mean angular velocity 
and major radius, respectively, of the terrestrial orbit about the Sun. Here, " = d 2 /dt 2 . On 
the other hand, the Moon's equation of motion takes the form 



2 3 '' r V, r i il /.; r M , , . 

r M = -n l a i - --n z a 5 - — ^, (14.2) 

|r M -r E | 3 |r M | 3 



where n = 13.17639646° per day and a = 384,399 km are the mean angular velocity 
and major radius, respectively, of the lunar orbit about the Earth. Note that we have 
retained the perturbing influence (i.e., acceleration) of the Sun in the lunar equation of 
motion, (14.2), whilst neglecting the perturbing influence of the Moon in the terrestrial 



2 For more information on lunar theory, see An Elementary Treatise on the Lunar Theory, H. Godfray 
(Macmillan & Co., 1853); An Introductory Treatise on the Lunar Theory, E.W. Brown (Cambridge Univer- 
sity Press, 1896); Lectures on the Lunar Theory, J.C. Adams (Cambridge University Press, 1900). 
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equation of motion, (14.1), since the former influence is significantly greater (by a factor 
M E /M M ~ 81, where M E is the mass of the Earth, and M M the mass of the Moon) than 
the latter. 
Let 

r = r M -r E , (14.3) 
r' = -r E , (14.4) 

be the position vectors of the Moon and Sun, respectively, relative to the Earth. It follows, 
from Equations (14.1)-(14.4), that in a non-inertial reference frame, S (say), in which the 
Earth is at rest at the origin, but the coordinate axes point infixed directions, the lunar and 
solar equations of motion take the form 



-n 2 a 3 -4t +n' 2 a' 3 



irl 3 



(r' - r) r' 



|l>' _ |>|3 | r /|3 



(14.5) 



f = -n /2 a' 3 ^, (14.6) 
|r I 

respectively. 

Let us set up a conventional Cartesian coordinate system in S which is such that the 
(apparent) orbit of the Sun about the Earth lies in the x-y plane. This implies that the x-y 
plane corresponds to the so-called ecliptic plane. Accordingly, in S, the Sun appears to orbit 
the Earth at the mean angular velocity cu' = n' e z (assuming that the z-axis points toward 
the so-called north ecliptic pole), whereas the projection of the Moon onto the ecliptic 
plane orbits the Earth at the mean angular velocity cu = ne z . 

In the following, for the sake of simplicity, we shall neglect the small eccentricity, e' = 
0.01 671 1 , of the Sun's apparent orbit about the Earth (which is actually the eccentricity 
of the Earth's orbit about the Sun), and approximate the solar orbit as a circle, centered 
on the Earth. Thus, if x', y', z' are the Cartesian coordinates of the Sun in S then an 
appropriate solution of the solar equation of motion, (14.6), is 

x' = a'cos(n't), (14.7) 
y' = a'sin(n't), (14.8) 
z' = 0. (14.9) 



14.3 Lunar Equations of Motion 

It is convenient to solve the lunar equation of motion, (14.5), in a geocentric frame of 
reference, Si (say), which rotates with respect to S at the fixed angular velocity cu. Thus, 
if the lunar orbit were a circle, centered on the Earth, and lying in the ecliptic plane, then 
the Moon would appear stationary in Si. In fact, the small eccentricity of the lunar orbit, 
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e = 0.05488, combined with its slight inclination to the ecliptic plane, i = 5.161°, causes 
the Moon to execute a small periodic orbit about the stationary point. 

Let x, y, z and xi, y i, Zi be the Cartesian coordinates of the Moon in S and Si, respec- 
tively. It is easily demonstrated that (see Section A. 16) 



x = Xi cos(nt)— yi sin(nt), 
y = xi sin(nt)+yi cos(nt), 

Z = Zi. 



(14.10) 
(14.11) 
(14.12) 



Moreover, if x\, yj, z\ are the Cartesian components of the Sun in Si then (see Section A.5) 



x\ = x' cos(nt) + y' sin(nt), 
y{ = — x' sin(nt) + y' cos(nt), 



(14.13) 
(14.14) 
(14.15) 



X, 



giving 

a' cos[(n-n')t], (14.16) 

-a' sin[(n-n')t], (14.17) 

z\ = 0, (14.18) 

where use has been made of Equations (14.7)-(14.9). 

Now, in the rotating frame Si, the lunar equation of motion (14.5) transforms to (see 
Chapter 7) 



r 

r + 2cuxr + tox (cu x r) = — n 2 a 3 -—j + n' 2 a/ 3 

T 



(r 7 - r) r' 



Ir'-rl 3 



./I 3 



(14.19) 



where ' = d/dt. Furthermore, expanding the final term on the right-hand side of (14.19) 
to lowest order in the small parameter a/a' = 0.00257, we obtain 



r + 2cuxr + cux (cu x r) ~ — n 2 a 3 



n /2 a /3 

IWI3 



(3r-r')r' 



r'\2 



(14.20) 



When written in terms of Cartesian coordinates, the above equation yields 



xi -2nyi - (n 2 + n ,2 /2) xi ~ -n 2 a 3 ^ + - n /2 cos[2 (n - n') t] x. 



— ^ n' 2 sin[2 (n — n') t] yi, 



(14.21) 



i)i +2nxi - (n 2 + n /2 /2)yi ~ -n 2 a 3 ^ - - n /2 sin[2 (n - n') t] xi 



— ^ n /2 cos[2 (n — n') t] yi, 



£i +n /2 zi 



2 3 Z 1 



(14.22) 
(14.23) 
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where r = (x 2 + y 2 + z 2 ) 1/2 , and use has been made of Equations (14.16)-(14.18). 

It is convenient, at this stage, to normalize all lengths to a, and all times to n 1 . Ac- 
cordingly, let 

X = xt/ci, (14.24) 
Y = Va, (14.25) 
Z = zi/a, (14.26) 

and r/a = R = (X 2 + Y 2 + Z 2 ) 1/2 , and T = nt. In normalized form, Equations (14.21)- 
(14.23) become 

X-2Y- (1 +m 2 /2)X ~ -^ + ^m 2 cos[2(1 -m)T]X 

~ m 2 sin[2 ( 1 — m) T] Y, (14.27) 

Y + 2X- (1 +m 2 /2)Y ~ ~ - | m 2 sin[2 (1 - m) T] X 

-j m 2 cos[2 (1 - m) T] Y, (14.28) 

Z + m 2 Z ~ -|r, (14.29) 

respectively, where m = n'/n = 0.07480 is a measure of the perturbing influence of the 
Sun on the lunar orbit. Here, " = d 2 /dT 2 and ' = d/dT. 
Finally, let us write 

X = X + 5X, (14.30) 
Y = 5Y, (14.31) 
Z = SZ, (14.32) 

where X = (1 + m 2 /2) _1/3 , and |5X|, |6Y|, \SZ\ < X . Thus, if the lunar orbit were a 
circle, centered on the Earth, and lying in the ecliptic plane, then, in the rotating frame Si, 
the Moon would appear stationary at the point (X , 0, 0). Expanding Equations (14.27)- 
(14.29) to second-order in 5X, 5Y, 6Z, and neglecting terms of order m 4 and m 2 5X 2 , etc., 
we obtain 

5X - 2 5Y - 3 (1 + m 2 /2) 5X ~ | m 2 cos[2 (1 - m) T] + | m 2 cos[2 (1 — m) T] 6X 

~ m 2 sin[2 (1 - m) T] 6Y - 3 5X 2 + | (6Y 2 + SZ 2 ), 

(14.33) 

6Y + 2 5X ~ -^m 2 sin[2(1 - m) T] - ^ m 2 sin[2(l - ra) T] 5X 
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~ m 2 cos[2 (1 - m) T] 5Y + 3 SX 5Y, (14.34) 
5Z+ (1 + 3m 2 /2)6Z ~ 3SX5Z. (14.35) 

Now, once the above three equations have been solved for SX, 6Y, and 6Z, the Cartesian 
coordinates, x,y, z, of the Moon in the non-rotating geocentric frame S are obtained from 
Equations (14.10)-(14.12), (14.24)-(14.26), and (14.30)-(14.32). However, it is more 
convenient to write x = r cos 9, y = r sin 0, and z = r sin |3, where r is the radial distance 
between the Earth and Moon, and 9 and |3 are termed the Moon's ecliptic longitude and 
ecliptic latitude, respectively. Moreover, it is easily seen that, to second-order in SX, 5Y, SZ, 
and neglecting terms of order m 4 , 

--1+^ ~ 5X + l5Y 2 + l5Z 2 , (14.36) 
a 6 2 2 

9-nt ~ 5Y-5X6Y, (14.37) 
|3 ~ 5Z-5X6Z. (14.38) 

14.4 Unperturbed Lunar Motion 

Let us, first of all, neglect the perturbing influence of the Sun on the Moon's orbit by setting 
m = in the lunar equations of motion (14.33)-(14.35). For the sake of simplicity, let us 
also neglect nonlinear effects in these equations by setting SX 2 = SY 2 = 5Z 2 = 5X 5Y = 
SX 5Z = 0. In this case, the equations reduce to 

5X-25Y-36X ~ 0, (14.39) 

5Y + 2 5X ~ 0, (14.40) 

6Z + 6Z ~ 0. (14.41) 

By inspection, appropriate solutions are 

5X ~ -e cos(T-cco), (14.42) 
5Y ~ 2e sin(T- cc ), (14.43) 
5Z ~ isin(T-yo), (14.44) 

where e, ocq, i, and yo are arbitrary constants. Recalling that T = nt, it follows from 
(14.36)-(14.38) that 

r ~ a[l — e cos(nt— ao)] , (14.45) 

9 ~ nt + 2e sin(nt- a ), (14.46) 

(3 ~ Lsin(nt-yo). (14.47) 
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However, Equations (14.45) and (14.46) are simply first-order (in e) approximations to 
the familiar Keplerian laws (see Chapter 5) 



where ' = d/ dt. Of course, the above two laws describe a body which executes an elliptical 
orbit, confocal with the Earth, of major radius a, mean angular velocity n, and eccentricity 
e, such that the radius vector connecting the body to the Earth sweeps out equal areas in 
equal time intervals. We conclude, unsurprisingly, that the unperturbed lunar orbit is a 
Keplerian ellipse. Note that the lunar perigee lies at the fixed ecliptic longitude = a . 
Equation (14.47) is the first-order approximation to 



This expression implies that the unperturbed lunar orbit is co-planar, but is inclined at an 
angle i to the ecliptic plane. Moreover, the ascending node lies at the fixed ecliptic longitude 
= Yo- Incidentally, the neglect of nonlinear terms in Equations (14.39)-(14.41) is only 
valid as long as e, i 1: i.e., provided that the unperturbed lunar orbit is only slightly 
elliptical, and slightly inclined to the ecliptic plane. In fact, the observed values of e and i 
are 0.05488 and 0.09008 radians, respectively, so this is indeed the case. 

14.5 Perturbed Lunar Motion 

The perturbed nonlinear lunar equations of motion, (14.33)-(14.35), take the general 
form 




(14.48) 



(14.49) 



(3 = i sin(0 - Yo). 



(14.50) 



5X-25Y-3(1 +m 2 /2)6X ~ R x , 
5Y + 2 6X ~ R Y , 
SZ+0 +3m 2 /2)6Z ~ R z , 



(14.51) 
(14.52) 
(14.53) 



where 




(14.54) 



j>0 




(14.55) 



j>0 




(14.56) 
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-CXo 
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Table 14.1: Angular frequencies and phase-shifts associated with the principal periodic driving 
terms appearing in the perturbed nonlinear lunar equations of motion. 



Let us search for solutions of the general form 

5X 



5Y 



6Z 



= x + Xj cos(a)j T — 

j>0 

= sin(o>jT- Oj), 

j>o 

= sin(fljT-Yj). 

j>0 



(14.57) 



(14.58) 



(14.59) 



Substituting expressions (14.54)-(14.59) into Equations (14.51)-(14.53), it is easily demon- 
strated that 

a 



x = 



Xi 



3(1 +m 2 /2)' 
2b, 



ajj (1 -3m 2 /2-a> 2 )' 



i CU; 



3 m 2 /2) bi — 2 cu; a 



cu 2 (l -3m 2 /2-cu 2 ) 

Ci 



(14.60) 
(14.61) 

(14.62) 
(14.63) 



1 +3m 2 /2-a 2 ' 
where j > 0. 

The angular frequencies, cuj, Clj, and phase shifts, ctj, y^, of the principal periodic driv- 
ing terms that appear on the right-hand sides of the perturbed nonlinear lunar equations of 
motion, (14.51)-(14.53), are specified in Table 14.1. Here, c and g are, as yet, unspecified 
0(1) constants associated with the precession of the lunar perigee, and the regression of 
the ascending node, respectively. Note that cui and D.^ are the frequencies of the Moon's 
unforced motion in ecliptic longitude and latitude, respectively. Moreover, 0)4 is the forc- 
ing frequency associated with the perturbing influence of the Sun. All other frequencies 
appearing in Table 14.1 are combinations of these three fundamental frequencies. In fact, 
iv 2 = 2cui, CU3 =2D.i, o>5 = CU4 — cui, Q.2 = u>i — Oi, O3 = a>i + Oi, and Q 5 = cu 4 — Oi. 
Note that there is no D.4. 
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Table 14.2: Amplitudes of the periodic driving terms appearing in the perturbed nonlinear 
lunar equations of motion. 



Now, a comparison of Equations (14.33)-(14.35), (14.51)-(14.53), and Table 14.1 
reveals that 

3 3 
Rx = - m 2 cos(cu 4 T — a 4 ) + - m 2 cos(tu 4 T — oc 4 ) 5X 

-j m 2 sin(a) 4 T — a 4 ) SY — 3 6X 2 + j (SY 2 + SZ 2 ), (14.64) 

3 3 
R Y = — - m 2 sin(cu 4 T — a 4 ) — - m 2 sin(cu 4 T — a 4 ) 6X 

-jm 2 cos(a) 4 T- cx 4 )SY + 3 5XSY, (14.65) 
R z = 3 6X5Z. (14.66) 

Substitution of the solutions (14.57)-(14.59) into the above equations, followed by a com- 
parison with expressions (14.54)-(14.56), yields the amplitudes aj, bj, and Cj specified in 
Table 14.2. Note that, in calculating these amplitudes, we have neglected all contribu- 
tions to the periodic driving terms, appearing in Equations (14.51)-(14.53) which involve 
cubic, or higher order, combinations of e, i, m 2 , Xj, Uj, and z- } , since we only expanded 
Equations (14.33)-(14.35) to second-order in 5X, 5Y, and 5Z. 
For j = 0, it follows from Equation (14.60) and Table 14.2 that 

xo = -- e 2 - ] - i 2 . (14.67) 

For j = 2, making the approximation cu 2 — 2 (see Table 14.1), it follows from Equations 
(14.61), (14.62) and Table 14.2 that 

x 2 ~ ^e 2 , (14.68) 
y 2 ~ ] -e 2 . (14.69) 
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Likewise, making the approximation D 2 — (see Table 14.1), it follows from Equation 
(14.63) and Table 14.2 that 

3 

z 2 ~-ei. (14.70) 

For j = 3, making the approximation cu 3 — 2 (see Table 14.1), it follows from Equations 
(14.61), (14.62) and Table 14.2 that 

x 3 ^ ] -t\ (14.71) 

u 3 ~ - ] -i 2 . (14.72) 

Likewise, making the approximation Q 3 ~ 2 (see Table 14.1), it follows from Equation 
(14.63) and Table 14.2 that 

z 3 ~ i e u (14.73) 

For j = 4, making the approximation CU4 ~ 2 (see Table 14.1), it follows from Equations 
(14.61), (14.62) and Table 14.2 that 

x 4 ~ -m 2 , (14.74) 

y 4 ~ ym 2 . (14.75) 

Thus, according to Table 14.2, 

9 7 

a 5 ~ ~r ttl 6, (14.76) 

o 

b 5 ~ TF m2e ' ( 14 - 77 ) 
I fo 

c 5 ~ ^m 2 u (14.78) 

For j = 5, making the approximation w 5 ~ 1 — 2 m (see Table 14.1), it follows from 
Equations (14.61), (14.62), (14.76), and (14.77) that 

15 

x 5 ~ — - me, (14.79) 

o 

y 5 ~ ^me. (14.80) 

Likewise, making the approximation Q 5 ~ 1 — 2 m (see Table 14.1), it follows from Equa- 
tions (14.63) and (14.78) that 

Z5~|mi. (14.81) 

o 
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Thus, according to Table 14.2, 

a, = -^-m 3 e, (14.82) 
64 

765 

bl = 128 m3e ' (14-83) 

ci = ^ttl 3 l. (14.84) 
1 6 

Finally, for j = 1 , by analogy with Equations (14.42)-(14.44), we expect 

ai = -e, (14.85) 

b, = 2e, (14.86) 

Ct = l. (14.87) 

Thus, since W] = 1 + cm 2 (see Table 14.1), it follows from Equations (14.61), (14.82), 
(14.83), and (14.85) that 

_e K 7™*\ , (14.88, 
—(3/2) m 2 — 2cm 2 

which yields 

3 225 

c~---— m + 0(m 2 ). (14.89) 

Likewise, since O] = 1 + g m 2 (see Table 14.1), it follows from Equations (14.63), (14.84), 
and (14.87) that 

lK r^i_, (14 . 9 o) 

(3/2) m 2 — 2 g m 2 



which yields 



g^^ + ^m + G(m 2 ). (14.91) 



According to the above analysis, our final expressions for 5X, 6Y, and 5Z are 



6X 



— \- e 2 — j l 2 — e cos[(1 + c m 2 ) T — cx ] + \ e 2 cos[2 (1 + c m 2 ) T — 2 a ] 



2 4 u ; UJ 2 

+- l 2 cos[2 (1 + g m 2 ) T - 2y ] - m 2 cos[2 (1 - m) T] 
15 

— — me cos[(1 -2m- cm 2 ) T + <x ], (14.92) 
8 

5Y = 2e sin[(1 +cm 2 )T- cx ] + ^ e 2 sin[2(1 +cm 2 )T-2cc ] 

-- i 2 sin[2 (1 + g m 2 ) T - 2y ] + ^ m 2 cos[2 (1 - m) T] 
4 8 
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15 

+— me sin[(1 -2m-cm 2 )T+ cx ], (14.93) 

3 

SZ = i sin[(1 + gm 2 ) T- y ] + - e i sin[(c - g) m 2 T - <x + y ] 

+j e i sin[(2 + c m 2 + g m 2 ) T — a — y ] 
3 

+-mi sin[(1 -2m- gm 2 )T + y ]. (14.94) 

8 

Thus, making use of Equations (14.36)-(14.38), we find that 
r 111 

— = 1 — e cos[( 1 + c m 2 ) T — ct ] + - e 2 — - m 2 — - e 2 cos[2 ( 1 + c m 2 ) T — 2 a ] 
a 2 6 2 

15 

— m 2 cos[2 (1 — m) T] — — me cos[(1 —2m — cm 2 ) T + a ], (14.95) 

8 

5 

9 = T + 2e sin[(1 +cm 2 )T- cc ] + - e 2 sin[2(1 +cm 2 )T-2cc ] 

-\ l 2 sin[2 (1 + g m 2 ) T - 2y ] + ^- m 2 sin[2 (1 - m) T] 
4 8 

15 

+— me sin[(1 -2m-cm 2 )T + a ], (14.96) 

(3 = i sin[(1 +gm 2 )T-y ] + ei sin[(c - g) m 2 T - a +y ] 

+e l sin[(2 + c m 2 + g m 2 ) T — a — yo] 
3 

+-mt sin[(1 -2m- gm 2 )T + y ]. (14.97) 
8 

The above expressions are accurate up to second-order in the small parameters e, v, and m. 



14.6 Description of Lunar Motion 

In order to better understand the perturbed lunar motion derived in the previous section, 
it is helpful to introduce the concept of the mean moon. This is an imaginary body which 
orbits the Earth, in the ecliptic plane, at a steady angular velocity that is equal to the 
Moon's mean orbital angular velocity, n. Likewise, the mean sun is a second imaginary 
body which orbits the Earth, in the ecliptic plane, at a steady angular velocity that is equal 
to the Sun's mean orbital angular velocity, n'. Thus, the ecliptic longitudes of the mean 
moon and the mean sun are 

9 = nt, (14.98) 
9' = n't, (14.99) 
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respectively. Here, for the sake of simplicity, and also for the sake of consistency with our 
previous analysis, we have assumed that both objects are located at ecliptic longitude 0° 
at time t = 0. 

Now, from Equation (14.95), to first-order in small parameters, the lunar perigee cor- 
responds to (1 + c m 2 ) nt — <x = j 2tl, where j is an integer. However, this condition can 
also be written 9 = a, where 

a=a + cc 1 n't, (14.100) 
and, making use of Equation (14.89), together with the definition m = n'/n, 

3 255 

ai = -m + — m 2 + C(m 3 ). (14.101) 

Thus, we can identify a as the mean ecliptic longitude of the perigee. Moreover, according 
to Equation (14.100), the perigee precesses (i.e., its longitude increases in time) at the 
mean rate of 360 <Xi degrees per year. (Of course, a year corresponds to At = In/n'.) 
Furthermore, it is clear that this precession is entirely due to the perturbing influence of the 
Sun, since it only depends on the parameter m, which is a measure of this influence. Given 
that m = 0.07480, we find that the perigee advances by 34.36° degrees per year. Hence, we 
predict that the perigee completes a full circuit about the Earth every 1 /<xi = 10.5 years. In 
fact, the lunar perigee completes a full circuit every 8.85 years. Our prediction is somewhat 
inaccurate because our previous analysis neglected 0[m 2 ), and smaller, contributions to 
the parameter c [see Equation (14.89)], and these turn out to be significant. 

From Equation (14.97), to first-order in small parameters, the Moon passes through its 
ascending node when (1 + g m 2 ) nt — y = yo + j 2n, where j is an integer. However, this 
condition can also be written = y, where 

y = yo-ym't, (14.102) 

and, making use of Equation (14.91), 

Yi = ^m-^m 2 + 0(m 3 ). (14.103) 

Thus, we can identify y as the mean ecliptic longitude of the ascending node. Moreover, 
according to Equation (14.102), the ascending node regresses (i.e., its longitude decreases 
in time) at the mean rate of 360 y] degrees per year. As before, it is clear that this re- 
gression is entirely due to the perturbing influence of the Sun. Moreover, we find that the 
ascending node retreats by 1 9.63° degrees per year. Hence, we predict that the ascend- 
ing node completes a full circuit about the Earth every 1/y] = 18.3 years. In fact, the 
lunar ascending node completes a full circuit every 18.6 years, so our prediction is fairly 
accurate. 

It is helpful to introduce the lunar mean anomaly, 



M = - cc, 



(14.104) 
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which is defined as the angular distance (in longitude) between the mean Moon and the 
perigee. It is also helpful to introduce the lunar mean argument of latitude, 

F = 9-y, (14.105) 

which is defined as the angular distance (in longitude) between the mean Moon and the 
ascending node. Finally, it is helpful to introduce the mean elongation of the Moon, 

D = e-e\ (14.106) 

which is defined as the difference between the longitudes of the mean Moon and the mean 
Sun. 

When expressed in terms of M, F, and D, our previous expression (14.96) for the true 
ecliptic longitude of the Moon becomes 

9 = 9 + A, (14.107) 

where 

5 1 11 15 

A = 2 e sin M + - e 2 sin 2 M — - i 2 sin 2 F + — m 2 sin 2 D + — m e sin(2 D — M) 
4 4 8 4 

(14.108) 

is the angular distance (in longitude) between the Moon and the mean Moon. The first 
three terms on the right-hand side of the above expression are Keplerian (i.e., they are 
independent of the perturbing action of the Sun) . In fact, the first is due to the eccentricity 
of the lunar orbit (i.e., the fact that the geometric center of the orbit is slightly shifted 
from the Earth), the second is due to the ellipticity of the orbit (i.e., the fact that the 
orbit is slightly non-circular), and the third is due to the slight inclination of the orbit to 
the ecliptic plane. However, the final two terms are caused by the perturbing action of 
the Sun. In fact, the fourth term corresponds to variation, whilst the fifth corresponds 
to evection. Note that variation attains its maximal amplitude around the so-called octant 
points, at which the Moon's disk is either one-quarter or three-quarters illuminated (i.e., 
when D = 45°, 135°, 225°, or 315°). Conversely, the amplitude of variation is zero around 
the so-called quadrant points, at which the Moon's disk is either fully illuminated, half 
illuminated, or not illuminated at all (i.e., when D = 0°, 90°, 180°, or 270°). Evection can 
be thought of as causing a slight reduction in the eccentricity of the lunar orbit around 
the times of the new moon and the full moon (i.e., D = 0° and D = 180°), and causing 
a corresponding slight increase in the eccentricity around the times of the first and last 
quarter moons (i.e., D = 90° and D = 270°). This follows because the evection term 
in Equation (14.108) augments the eccentricity term, 2e sinM, when cos 2D = —1, and 
reduces the term when cos 2D = +1. The variation and evection terms appearing in 
expression (14.108) oscillate sinusoidally with periods of half a synodic month, 3 or 14.8 



3 A synodic month, which is 29.53 days, is the mean period between successive new moons. 
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days, and 31 .8 days, respectively. These periods are in good agreement with observations. 
Finally, the amplitudes of the variation and evection terms (calculated using m = 0.07480 
and e = 0.05488) are 1630 and 3218 arc seconds, respectively. However, the observed 
amplitudes are 2370 and 4586 arc seconds, respectively. It turns out that our expressions for 
these amplitudes are somewhat inaccurate because, for the sake of simplicity, we have only 
calculated the lowest order (in m) contributions to them. Recall that we also neglected 
the slight eccentricity, e' = 0.016711, of the Sun's apparent orbit about the Earth in our 
calculation. In fact, the eccentricity of the solar orbit gives rise to a small addition term 
—3 me' sinM' on the right-hand side of (14.108), where M' is the Sun's mean anomaly. 
This term, which is known as the annual equation, oscillates with a period of a solar year, 
and has an amplitude of 772 arc seconds. 

When expressed in terms of D and F our previous expression (14.97) for the ecliptic 
latitude of the Moon becomes 

(3 = i sin(F + A) + | m l sin(2 D — F). (14.109) 
8 

The first term on the right-hand side of this expression is Keplerian (i.e., it is independent 
of the perturbing influence of the Sun) . However, the final term, which is known as evection 
in latitude, is due to the Sun's action. Evection in latitude can be thought of as causing a 
slight increase in the inclination of the lunar orbit to the ecliptic at the times of the first 
and last quarter moons, and a slight decrease at the times of the new moon and the full 
moon. The evection in latitude term oscillates sinusoidally with a period of 32.3 days, and 
has an amplitude of 521 arc seconds. This period is in good agreement with observations. 
However, the observed amplitude of the evection in latitude term is 624 arc seconds. As 
before, our expression for the amplitude is somewhat inaccurate because we have only 
calculated the lowest order (in m) contribution. 
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15 The Chaotic Pendulum 



15.1 Introduction 

Up until now, we have only dealt with dynamical problems which are capable of analytic 
solution. Let us now investigate a problem which is quite intractable analytically, and in 
which meaningful progress can only be made via numerical methods. 



15.2 Basic Problem 

Consider a simple pendulum consisting of a point mass m, at the end of a light rigid rod of 
length I, attached to a fixed frictionless pivot which allows the rod (and the mass) to move 
freely under gravity in the vertical plane. Such a pendulum is sketched in Figure 15.1. Let 
us parameterize the instantaneous position of the pendulum via the angle 9 the rod makes 
with the downward vertical. It is assumed that the pendulum is free to swing through a 
full circle. Hence, 9 and 9 + 2n both correspond to the same pendulum position. 
The angular equation of motion of the pendulum is simply 

d 2 9 

ml -j^j + mg sin9 = 0, (15.1) 

where g is the downward acceleration due to gravity — see Section 3.10. Suppose that the 
pendulum is embedded in a viscous medium (e.g., air). Let us assume that the viscous drag 
torque acting on the pendulum is directly proportional to the pendulum's instantaneous 
velocity. It follows that, in the presence of viscous drag, the above equation generalizes to 

d^9 d9 

ml— - + v— + mg sin 9 = 0, (15.2) 
dt z dt 

where v is a positive constant parameterizing the viscosity of the medium in question — 
see Section 3.5. Of course, viscous damping will eventually drain all energy from the 
pendulum, leaving it in a stationary state. In order to maintain the motion against viscosity, 
it is necessary to add some external driving. For the sake of simplicity, we choose a fixed 
amplitude periodic drive (which could arise, for instance, via periodic oscillations of the 
pendulum's pivot point) . Thus, the final equation of motion of the pendulum is written 

d^9 d9 

ml — t+v h mg sin9 = A cos(o)t), (15.3) 

dt z dt 

where A and cu are constants parameterizing the amplitude and frequency of the external 
driving torque, respectively. 
Let 

o) = (15.4) 
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fixed support 




t 

mg 

Figure 15.1: A simple pendulum. 

Of course, we recognize cu as the natural frequency of small amplitude oscillations of the 
pendulum. We can conveniently normalize the pendulum's equation of motion by writing, 



t = 


O) t, 


(15.5) 


ct> = 


cu 


(15.6) 




tuo' 




Q = 


rag 


(15.7) 




tu v' 




A = 


A 


(15.8) 




mg' 





in which case Equation (15.3) becomes 

d^9 1 d9 

-ys T + — -K + smd = Acos{(bt). (15.9) 
dt z at 

From now on, the hats on normalized quantities will be omitted, for ease of notation. 
Note that, in normalized units, the natural frequency of small amplitude oscillations is 
unity. Moreover, Q is the familiar quality -factor, which is, roughly speaking, the number 
of oscillations of the undriven system which must elapse before its energy is significantly 
reduced via the action of viscosity — see Section 3.6. The quantity A is the amplitude of the 
external torque measured in units of the maximum possible gravitational torque. Finally, 
cu is the oscillation frequency of the external torque measured in units of the pendulum's 
natural frequency. 

Equation (15.9) is clearly a second-order ordinary differential equation. It can, there- 
fore, also be written as two coupled first-order ordinary differential equations: 



(15.10) 
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dv v 

— = — — — sin0 + A cos (cut), 
dt Q 



(15.11) 



15.3 Analytic Solution 

Before attempting a numerical solution of the equations of motion of any dynamical sys- 
tem, it is a good idea to, first, investigate the equations as thoroughly as possible via 
standard analytic techniques. Unfortunately, Equations (15.10) and (15.11) constitute a 
non-linear dynamical system — because of the presence of the sin term on the right-hand 
side of Equation (15.11). This system, like most non-linear systems, does not possess a 
simple analytic solution. Fortunately, however, if we restrict our attention to small ampli- 
tude oscillations, such that the approximation 



sin 9 ~ 9 

is valid, then the system becomes linear, and can easily be solved analytically. 
The linearized equations of motion of the pendulum take the form: 

d9 
dt 



(15.12) 



= v, 



dv v 

— = — — - - 9 + A cos(cut). 
dt Q 



(15.13) 
(15.14) 



Suppose that the pendulum's position, 9(0), and velocity, v(0), are specified at time t = 0. 
As is well-known, in this case, the above equations of motion can be solved analytically to 
give: 

A(l -cu 2 ) 



0(t) = 9(0) 



1 



+— v(0) 
cu 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 

9(0) A(l-3cu 2 )/2Q 



2Q [(1 - a> 2 ) 2 + cu 2 /Q 2 ] 



e -t/2Q cos^t) 

e~ t/2Q sin(cu«t) 



A 



'1 — cu 2 ) cos(cut) + (cu/Q) sin(cut) 



v(t) 



v(0)- 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 
Acu 2 /Q 



(15.15) 



-t/2Q 



cos cu*t 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 

A[(1 - cu 2 ) - cu 2 /2Q 2 ] 



cu, \ 2Q 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 



e- t/2Q sin(cu*t) 



cu A 



-(1 — cu 2 ) sin(cut) + (cu/Q) cos(cut) 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 



Here, 



cu* 



'1- 



4Q 2 ' 



(15.16) 



(15.17) 
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Figure 15.2: A phase-space plot of the periodic attractor for a linear damped periodically 
driven pendulum. Data calculated analytically for Q = 4 and cu = 2. 

and it is assumed that Q > 1 /2. It can be seen that the above expressions for and v both 
consist of three terms. The first two terms clearly represent transients, since they depend 
on the initial conditions, and decay exponentially in time — see Section 3.9. In fact, the e- 
folding time for the decay of these terms is 2 Q (in normalized time units) . The final term 
represents the time-asymptotic motion of the pendulum, and is manifestly independent of 
the initial conditions — see Section 3.5. 

It is often convenient to visualize the motion of a dynamical system as an orbit, or 
trajectory in phase-space, which is defined as the space of all of the dynamical variables 
required to specify the instantaneous state of the system. For the case in hand, there are 
two dynamical variables, v and 9, and so phase-space corresponds to the 0-v plane. Note 
that each different point in this plane corresponds to a unique instantaneous state of the 
pendulum. [Strictly speaking, we should also consider t to be a dynamical variable, since 
it appears explicitly on the right-hand side of Equation (15.11).] 

It is clear, from Equations (15.15) and (15.16), that if we wait long enough for all of the 
transients to decay away then the motion of the pendulum settles down to the following 
simple orbit in phase-space: 



B(t) = 



A (1-cu 2 ) cos(cut) + (cu/Q) sin(cut) 



(15.18) 



v(t) = 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 
cu A — (1 — cu 2 ) sin(cu t) + (cu/Q) cos(cu t) 



(15.19) 



[(1 - cu 2 ) 2 + cu 2 /Q 2 ] 



This orbit traces out the closed curve 




(15.20) 
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Figure 15.3: The maximum angular displacement of a linear damped periodically driven 
pendulum as a function of driving frequency. The solid curve corresponds to Q = 1 . The 
short-dashed curve corresponds to Q = 5. The long-dashed curve corresponds to Q = 10. 
Analytic data. 

in phase-space, where 

A = . A (15.21) 

y/0 - U) 2 ) 2 + 0> 2 /Q 2 

As illustrated in Figure 15.2, this curve is an ellipse whose principal axes are aligned with 
the v and 6 coordinate axes. Observe that the curve is closed, which suggests that the 
associated motion is periodic in time. In fact, the motion repeats itself exactly every 

t = — — (15.22) 

normalized time units. The maximum angular displacement of the pendulum from its 
undriven rest position (0 = 0) is A. As illustrated in Figure 15.3, the variation of A 
with driving frequency cu [see Equation (15.21)] displays all of the features of a classic 
resonance curve. The maximum amplitude of the driven oscillation is proportional to 
the quality-factor, Q, and is achieved when the driving frequency matches the natural 
frequency of the pendulum (i.e., when |tu| = 1). Moreover, the width of the resonance in 
cu-space is proportional to 1/Q — see Section 3.7. 

The phase-space curve shown in Figure 15.2 is called aperiodic attractor. It is termed 
an "attractor" because, irrespective of the initial conditions, the trajectory of the system 
in phase-space tends asymptotically to — in other words, is attracted to — this curve as 
t — > oo. This gravitation of phase-space trajectories towards the attractor is illustrated 
in Figures 15.4 and 15.5. Of course, the attractor is termed "periodic" because it corre- 
sponds to motion which is periodic in time. 

Let us summarize our findings so far. We have discovered that if a damped pendu- 
lum is subject to a low amplitude periodic drive then its time-asymptotic response (i.e., its 
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Figure 15.4: The phase-space trajectory of a linear damped periodically driven pendulum. 
Data calculated analytically for Q = 1 and iv = 2. Here, v(0)/A = and 0(O)/A = 0. 
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Figure 15.5: The phase-space trajectory of a linear damped periodically driven pendulum. 
Data calculated analytically for Q = 1 and w = 2. Here, v(0)/A = 0.5 and 9(0)/A = 0.5. 
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response after any transients have died away) is periodic, with the same period as the driv- 
ing torque. Moreover, the response exhibits resonant behaviour as the driving frequency 
approaches the natural frequency of oscillation of the pendulum. The amplitude of the res- 
onant response, as well as the width of the resonant window, is governed by the amount 
of damping in the system. After a little reflection, we can easily appreciate that all of these 
results are a direct consequence of the linearity of the pendulum's equations of motion 
in the low amplitude limit. In fact, it is easily demonstrated that the time-asymptotic re- 
sponse of any intrinsically stable linear system (with a discrete spectrum of normal modes) 
to a periodic drive is periodic, with the same period as the drive. Moreover, if the driv- 
ing frequency approaches one of the natural frequencies of oscillation of the system then 
the response exhibits resonant behaviour. But, is this the only allowable time-asymptotic 
response of a dynamical system to a periodic drive? It turns out that it is not. Indeed, 
the response of a non-linear system to a periodic drive is generally much richer and far 
more diverse than simple periodic motion. Since the majority of naturally occurring dy- 
namical systems are non-linear, it is clearly important that we gain a basic understanding 
of this phenomenon. Unfortunately, we cannot achieve this goal via a standard analytic 
approach, since non-linear equations of motion generally do not possess simple analytic 
solutions. Instead, we must employ numerical methods. As an example, let us investigate 
the dynamics of a damped pendulum, subject to a periodic drive, with no restrictions on 
the amplitude of the pendulum's motion. 

15.4 Numerical Solution 

In the following, we present numerical solutions of Equations (15.10) and (15.11) ob- 
tained using a fairly standard fixed step-length fourth-order Runge-Kutta integration scheme. 1 

15.5 Poincare Section 

For the sake of definiteness, let us fix the normalized amplitude and frequency of the ex- 
ternal drive to be A = 1.5 and cu = 2/3, respectively. 2 Furthermore, let us investigate any 
changes which may develop in the nature of the pendulum's time-asymptotic motion as 
the quality-factor Q is varied. Of course, if Q is made sufficiently small (i.e., if the pendu- 
lum is embedded in a sufficiently viscous medium) then we expect the amplitude of the 
pendulum's time-asymptotic motion to become low enough that the linear analysis out- 
lined in Section 15.3 is valid. Indeed, we expect non-linear effects to manifest themselves 
as Q is gradually made larger, and the amplitude of the pendulum's motion consequently 
increases to such an extent that the small angle approximation breaks down. 



1 W.H. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery, Numerical recipes in C: The art of scientific 
computing, 2nd Edition (Cambridge University Press, Cambridge UK, 1992), Section 16.1. 
2 G.L. Baker, Control of the chaotic driven pendulum, Am. J. Phys. 63, 832 (1995). 
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Figure 15.6: Equally spaced (in time) points on a time-asymptotic orbit in phase-space. Data 
calculated numerically for Q = 0.5, A = 1 .5, w = 2/3, 9(0) = 0, and v(0) = 0. 

Figure 15.6 shows a time-asymptotic orbit in phase-space calculated numerically for 
a case where Q is sufficiently small (i.e., Q = 1/2) that the small angle approximation 
holds reasonably well. Not surprisingly the orbit is very similar to the analytic orbits 
described in Section 15.3. The fact that the orbit consists of a single loop, and forms a 
closed curve in phase-space, strongly suggests that the corresponding motion is periodic 
with the same period as the external drive — we term this type of motion period- 1 motion. 
More generally, period-n motion consists of motion which repeats itself exactly every n 
periods of the external drive (and, obviously, does not repeat itself on any time-scale less 
than n periods). Of course, period- 1 motion is the only allowed time-asymptotic motion 
in the small angle limit. 

It would certainly be helpful to possess a graphical test for period-n motion. In fact, 
such a test was developed more than a hundred years ago by the French mathematician 
Henry Poincare. Nowadays, it is called a Poincare section in his honour. The idea of a 
Poincare section, as applied to a periodically driven pendulum, is very simple. As before, 
we calculate the time-asymptotic motion of the pendulum, and visualize it as a series of 
points in 9-v phase-space. However, we only plot one point per period of the external drive. 
To be more exact, we only plot a point when 

a)t = 4) + k27t (15.23) 

where k is any integer, and 4> is referred to as the Poincare phase. For period- 1 motion, in 
which the motion repeats itself exactly every period of the external drive, we expect the 
Poincare section to consist of only one point in phase-space (i.e., we expect all of the points 
to plot on top of one another) . Likewise, for period-2 motion, in which the motion repeats 
itself exactly every two periods of the external drive, we expect the Poincare section to 
consist of two points in phase-space (i.e., we expect alternating points to plot on top of one 
another) . Finally, for period-n motion we expect the Poincare section to consist of n points 
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Figure 15.7: The Poincare section of a time-asymptotic orbit. Data calculated numerically for 
Q = 0.5, A = 1 .5, w = 2/3, 9(0) = 0, v(0) = 0, and $ = 0. 

in phase-space. 

Figure 15.7 displays the Poincare section of the orbit shown in Figure 15.6. The fact 
that the section consists of a single point confirms that the motion displayed in Figure 15.6 
is indeed period- 1 motion. 

15.6 Spatial Symmetry Breaking 

Suppose that we now gradually increase the quality- factor Q. What happens to the simple 
orbit shown in Figure 15.6? It turns out that, at first, nothing particularly exciting hap- 
pens. The size of the orbit gradually increases, indicating a corresponding increase in the 
amplitude of the pendulum's motion, but the general nature of the motion remains un- 
changed. However, something interesting does occur when Q is increased beyond about 
1.2. Figure 15.8 shows the v-coordinate of the orbit's Poincare section plotted against Q 
in the range 1 .2 and 1 .3. Note the sharp downturn in the curve at Q ~ 1 .245. What does 
this signify? Well, Figure 15.9 shows the time-asymptotic phase-space orbit just before the 
downturn (i.e., at Q = 1.24), and Figure 15.10 shows the orbit somewhat after the down- 
turn (i.e., at Q = 1.30). It is clear that the downturn is associated with a sudden change 
in the nature of the pendulum's time-asymptotic phase-space orbit. Prior to the downturn, 
the orbit spends as much time in the region 9 < as in the region 9 > 0. However, after 
the downturn the orbit spends the majority of its time in the region 9 < 0. In other words, 
after the downturn, the pendulum bob favours the region to the left of the pendulum's ver- 
tical. This is somewhat surprising, since there is nothing in the pendulum's equations of 
motion which differentiates between the regions to the left and to the right of the vertical. 
We refer to a solution of this type — i.e., one which fails to realize the full symmetry of the 
dynamical system in question — as a symmetry breaking solution. In this case, because the 



244 



NEWTONIAN DYNAMICS 



2.4 
2.2 
> 2 
1.8 
1.6 

1.2 1.22 1.24 1.26 1.28 1.3 
Q 

Figure 15.8: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 0(0) = 0, 
v(0) = 0, and 4> = 0. 

particular symmetry which is broken is a spatial symmetry, we refer to the process by which 
the symmetry breaking solution suddenly appears, as the control parameter Q is adjusted, 
as spatial symmetry breaking. Needless to say, spatial symmetry breaking is an intrinsically 
non-linear process — it cannot take place in dynamical systems possessing linear equations 
of motion. 

It stands to reason that since the pendulum's equations of motion favour neither the left 
nor the right then the left-favouring orbit pictured in Figure 15.10 must be accompanied 
by a mirror image right-favouring orbit. How do we obtain this mirror image orbit? It 
turns out that all we have to do is keep the physical parameters Q, A, and tu fixed, but 
change the initial conditions 8(0) and v(0). Figure 15.11 shows a time-asymptotic phase- 
space orbit calculated with the same physical parameters used in Figure 15.10, but with 
the initial conditions 6(0) =0 and v(0) = -3, instead of 6(0) =0 and v(0) = 0. It can be 
seen that the orbit is indeed the mirror image of that pictured in Figure 15.10. 

Figure 15.12 shows the v-coordinate of the Poincare section of a time-asymptotic orbit, 
calculated with the same physical parameters used in Figure 15.8, versus Q in the range 
1 .2 and 1 .3. Data is shown for the two sets of initial conditions discussed above. The 
figure is interpreted as follows. When Q is less than a critical value, which is about 1 .245, 
then the two sets of initial conditions lead to motions which converge on the same left- 
right symmetric period- 1 attractor. However, once Q exceeds the critical value then the 
attractor bifurcates into two asymmetric mirror image period-1 attractors. Obviously, the 
bifurcation is indicated by the forking of the curve shown in Figure 15.12. The lower and 
upper branches correspond to the left- and right-favouring attractors, respectively. 

Spontaneous symmetry breaking, which is the fundamental non-linear process illus- 
trated in the above discussion, plays an important role in many areas of physics. For 
instance, symmetry breaking gives mass to elementary particles in the unified theory of 
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Figure 15.9: Equally spaced (in time) points on a time-asymptotic orbit in phase-space. Data 
calculated numerically for Q = 1 .24, A = 1 .5, w = 2/3, 9(0) = 0, andv{0) = 0. 




Figure 15.10: Equally spaced (in time) points on a time-asymptotic orbit in phase-space. 
Data calculated numerically for Q = 1 .30, A = 1 .5, a> = 2/3, 9(0) = 0, and v(0) = 0. 
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Figure 15.11: Equally spaced (in time) points on a time-asymptotic orbit in phase-space. 
Data calculated numerically for Q = 1 .30, A = 1 .5, cu = 2/3, 9(0) = 0, and v(0) = -3. 
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Figure 15.12: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, and cu = 2/3. Data is 
shown for two sets of initial conditions: 9(0) = and v(0) = (lower branch); and 6(0) = 
and v(0) = —3 (upper branch). 
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electromagnetic and weak interactions. 3 Symmetry breaking also plays a pivotal role in 
the so-called "inflation" theory of the expansion of the early universe. 4 

15.7 Basins of Attraction 

We have seen that when Q = 1.3, A = 1.5, and cu = 2/3 there are two co-existing 
period-1 attractors in 9-v phase-space. The time-asymptotic trajectory of the pendulum 
through phase-space converges on one or other of these attractors depending on the initial 
conditions: i.e., depending on the values of 9(0) and v(0). Let us define the basin of 
attraction of a given attractor as the locus of all points in the 9(0)-v(0) plane which lead 
to motion which ultimately converges on that attractor. We have seen that in the low- 
amplitude (i.e., linear) limit (see Section 15.3) there is only a single period-1 attractor 
in phase-space, and all possible initial conditions lead to motion which converges on this 
attractor. In other words, the basin of attraction for the low-amplitude attractor constitutes 
the entire 0(O)-v(O) plane. The present case, in which there are two co-existing attractors 
in phase-space, is somewhat more complicated. 

Figure 15.13 shows the basins of attraction, in 9(0)-v(0) space, of the asymmetric 
mirror image attractors pictured in Figures 15.10 and 15.11. The basin of attraction of 
the left-favoring attractor shown in Figure 15.10 is coloured black, whereas the basin of 
attraction of the right-favoring attractor shown in Figure 15.11 is coloured white. It can be 
seen that the two basins form a complicated interlocking pattern. Since we can identify the 
angles n and — n, the right-hand edge of the pattern connects smoothly with its left-hand 
edge. In fact, we can think of the pattern as existing on the surface of a cylinder. 

Suppose that we take a diagonal from the bottom left-hand corner of Figure 15.13 
to its top right-hand corner. This diagonal is intersected by a number of black bands of 
varying thickness. Observe that the two narrowest bands (i.e., the fourth band from the 
bottom left-hand corner and the second band from the upper right-hand corner) both 
exhibit structure which is not very well resolved in the present picture. 

Figure 15.14 is a blow-up of a region close to the lower left-hand corner of Figure 15.13. 
It can be seen that the unresolved band in the latter figure (i.e., the second and third bands 
from the right-hand side in the former figure) actually consists of a closely spaced pair of 
bands. Note, however, that the narrower of these two bands exhibits structure which is 
not very well resolved in the present picture. 

Figure 15.15 is a blow-up of a region of Figure 15.14. It can be seen that the unresolved 
band in the latter figure (i.e., the first and second bands from the left-hand side in the 
former figure) actually consists of a closely spaced pair of bands. Note, however, that the 
broader of these two bands exhibits structure which is not very well resolved in the present 
picture. 

3 E.S. Albers and B.W. Lee, Phys. Rep. 9C, 1 (1973). 

4 P. Coles, and F. Lucchin, Cosmology: The origin and evolution of cosmic structure, (J. Wiley & Sons, 
Chichester UK, 1995). 
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Figure 15.13: The basins of attraction for the asymmetric, mirror image, attractors pictured 
in Figures 15.10 and 15.11. Regions o/9(0)-v(0) space which lead to motion converging on 
the left-favouring attractor shown in Figure 15.10 are coloured white: regions of 9(0)-v(0) 
space which lead to motion converging on the right-favouring attractor shown in Figure 15.11 
are coloured black. Data calculated numerically for Q = 1 3, A = 1 .5, uo = 2/3, and = 0. 
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Figure 15.14: Detail of the basins of attraction for the asymmetric, mirror image, attractors 
pictured in Figures 15.10 and 15.11. Regions o/9(0)-v(0) space which lead to motion con- 
verging on the left-favouring attractor shown in Figure 15.10 are coloured white: regions of 
9(0)-v(0) space which lead to motion converging on the right-favouring attractor shown in 
Figure 15.11 are coloured black. Data calculated numerically for Q = 1 .3, A = 1 .5, cv = 2/3, 
and c() = 0. 
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Figure 15.15: Detail of the basins of attraction for the asymmetric, mirror image, attractors 
pictured in Figures 15.10 and 15.11. Regions o/0(O)-v(O) space which lead to motion con- 
verging on the left-favouring attractor shown in Figure 15.10 are coloured white: regions of 
9(0)-v(0) space which lead to motion converging on the right- favouring attractor shown in 
Figure 15.11 are coloured black. Data calculated numerically for Q = 1 .3, A = 1 .5, cv = 2/3, 
and c() = 0. 
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Figure 15.16: Detail of the basins of attraction for the asymmetric, mirror image, attractors 
pictured in Figures 15.10 and 15.11. Regions o/0(O)-v(O) space which lead to motion con- 
verging on the left-favouring attractor shown in Figure 15.10 are coloured white: regions of 
9(0)-v(0) space which lead to motion converging on the right-favouring attractor shown in 
Figure 15.11 are coloured black. Data calculated numerically for Q = 1 .3, A = 1 .5, cv = 2/3, 
and c() = 0. 
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Figure 15.16 is a blow-up of a region of Figure 15.15. It can be seen that the unresolved 
band in the latter figure (i.e., the first, second, and third bands from the right-hand side 
in the former figure) actually consists of a closely spaced triplet of bands. Note, however, 
that the narrowest of these bands exhibits structure which is not very well resolved in the 
present picture. 

It should be clear, by this stage, that no matter how closely we look at Figure 15.13 we 
are going to find structure which we cannot resolve. In other words, the separatrix between 
the two basins of attraction shown in this figure is a curve which exhibits structure at all 
scales. Mathematicians have a special term for such a curve — they call it a fractal. 5 

Many people think of fractals as mathematical toys whose principal use is the genera- 
tion of pretty pictures. However, it turns out that there is a close connection between frac- 
tals and the dynamics of non-linear systems — particularly systems which exhibit chaotic 
dynamics. We have just seen an example in which the boundary between the basins of 
attraction of two co-existing attractors in phase-space is a fractal curve. This turns out to 
be a fairly general result: i.e., when multiple attractors exist in phase-space the separatrix 
between their various basins of attraction is invariably fractal. What is this telling us about 
the nature of non-linear dynamics? Well, returning to Figure 15.13, we can see that in the 
region of phase-space in which the fractal behaviour of the separatrix manifests itself most 
strongly (i.e., the region where the light and dark bands fragment) the system exhibits 
abnormal sensitivity to initial conditions. In other words, we only have to change the ini- 
tial conditions slightly (i.e., so as to move from a dark to a light band, or vice versa) in 
order to significantly alter the time-asymptotic motion of the pendulum (i.e., to cause the 
system to converge to a left-favouring instead of a right-favouring attractor, or vice versa) . 
Fractals and extreme sensitivity to initial conditions are themes which will reoccur in our 
investigation of non-linear dynamics. 

15.8 Period-Doubling Bifurcations 

Let us now return to Figure 15.8. Recall, that as the quality-factor Q is gradually increased, 
the time-asymptotic orbit of the pendulum through phase-space undergoes a sudden tran- 
sition, at Q ~ 1 .245, from a left-right symmetric period- 1 orbit to a left-favouring period- 1 
orbit. What happens if we continue to increase Q? Figure 15.17 is basically a continuation 
of Figure 15.8. It can be seen that as Q is increased the left-favouring period- 1 orbit grad- 
ually evolves until a critical value of Q, which is about 1 .348, is reached. When Q exceeds 
this critical value the nature of the orbit undergoes another sudden change: this time from 
a left-favouring period- 1 orbit to a left-favouring period-2 orbit. Obviously, the change is 
indicated by the forking of the curve in Figure 15.17. This type of transition is termed a 
period-doubling bifurcation, since it involves a sudden doubling of the repetition period of 
the pendulum's time-asymptotic motion. 

We can represent period- 1 motion schematically as AAAAAA • • •, where A represents 
a pattern of motion which is repeated every period of the external drive. Likewise, we 



5 B.B. Mandelbrot, The fractal geometry of nature, (W.H. Freeman, New York NY, 1982). 
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Figure 15.17: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 0(0) = 0, 
v(0) = 0, and 4> = 0. 

can represent period-2 motion as ABABAB ■ ■ ■, where A and B represent distinguishable 
patterns of motion which are repeated every alternate period of the external drive. A 
period-doubling bifurcation is represented: AAAAAA • • • — > ABABAB • • •. Clearly, all 
that happens in such a bifurcation is that the pendulum suddenly decides to do something 
slightly different in alternate periods of the external drive. 

Figure 15.18 shows the time-asymptotic phase-space orbit of the pendulum calculated 
for a value of Q somewhat higher than that required to trigger the above mentioned 
period-doubling bifurcation. It can be seen that the orbit is left-favouring (i.e., it spends 
the majority of its time on the left-hand side of the plot), and takes the form of a closed 
curve consisting of two interlocked loops in phase-space. Recall that for period-1 orbits 
there was only a single closed loop in phase-space. Figure 15.19 shows the Poincare sec- 
tion of the orbit shown in Figure 15.18. The fact that the section consists of two points 
confirms that the orbit does indeed correspond to period-2 motion. 

A period-doubling bifurcation is an example of temporal symmetry breaking. The equa- 
tions of motion of the pendulum are invariant under the transformation t — > t + x, where 
t is the period of the external drive. In the low amplitude (i.e., linear) limit, the time- 
asymptotic motion of the pendulum always respects this symmetry. However, as we have 
just seen, in the non-linear regime it is possible to obtain solutions which spontaneously 
break this symmetry. Obviously, motion which repeats itself every two periods of the ex- 
ternal drive is not as temporally symmetric as motion which repeats every period of the 
drive. 

Figure 15.20 is essentially a continuation of Fig 15.12. Data is shown for two sets of 
initial conditions which lead to motions converging on left-favouring (lower branch) and 
right-favouring (upper branch) periodic attractors. We have already seen that the left- 
favouring periodic attractor undergoes a period-doubling bifurcation at Q = 1 .348. It is 
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Figure 15.18: Equally spaced (in time) points on a time- asymptotic orbit in phase-space. 
Data calculated numerically for Q = 1 .36, A = 1 .5, cu = 2/3, 9(0) = 0, and v(0) = —3. 



2.0 I — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — 

1.5 - 

> 1.0 - 

0.5 - 

o.o r 

-1.0 -0.5 0.0 0.5 1.0 

e / 7T 



Figure 15.19: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1 .36, A = ].5,uo = 2/3, 9(0) = 0, v(0) = 0, and $ = 0. 
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Figure 15.20: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, a) = 2/3, and 4> = 0. 
Data is shown for two sets of initial conditions: 0(0) =0 and v(0) = (lower branch); and 
0(0) =0 and v(0) = —2 (upper branch). 

clear from Figure 15.20 that the right-favouring attractor undergoes a similar bifurcation 
at almost exactly the same Q-value. This is hardly surprising since, as has already been 
mentioned, for fixed physical parameters (i.e., Q, A, cu), the left- and right-favouring 
attractors are mirror-images of one another. 

15.9 Route to Chaos 

Let us return to Figure 15.17, which tracks the evolution of a left-favouring periodic at- 
tractor as the quality-factor Q is gradually increased. Recall that when Q exceeds a critical 
value, which is about 1 .348, then the attractor undergoes a period-doubling bifurcation 
which converts it from a period- 1 to a period-2 attractor. This bifurcation is indicated by 
the forking of the curve in Figure 15.17. Let us now investigate what happens as we con- 
tinue to increase Q. Figure 15.21 is basically a continuation of Figure 15.17. It can be seen 
that, as Q is gradually increased, the attractor undergoes a period-doubling bifurcation at 
Q = 1 .348, as before, but then undergoes a second period-doubling bifurcation (indicated 
by the second forking of the curves) at Q ~ 1 .370, and a third bifurcation at Q ~ 1 .375. 
Obviously, the second bifurcation converts a period-2 attractor into a period-4 attractor 
(hence, two curves split apart to give four curves) . Likewise, the third bifurcation converts 
a period-4 attractor into a period-8 attractor (hence, four curves split into eight curves). 
Shortly after the third bifurcation, the various curves in the figure seem to expand explo- 
sively and merge together to produce an area of almost solid black. As we shall see, this 
behaviour is indicative of the onset of chaos. 

Figure 15.22 is a blow-up of Figure 15.21, showing more details of the onset of chaos. 
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Figure 15.21: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 0(0) = 0, 
v(0) = 0, and 4> = 0. 

The period-4 to period-8 bifurcation can be seen quite clearly. However, we can also see a 
period-8 to period-16 bifurcation, at Q ~ 1 .3755. Finally, if we look carefully, we can see a 
hint of a period-16 to period-32 bifurcation, just before the start of the solid black region. 
Figures 15.21 and 15.22 seem to suggest that the onset of chaos is triggered by an infinite 
series of period-doubling bifurcations. 

Table 15.1 gives some details of the sequence of period-doubling bifurcations shown in 
Figures 15.21 and 15.22. Let us introduce a bifurcation index n: the period-1 to period- 
2 bifurcation corresponds to n = 1 ; the period-2 to period-4 bifurcation corresponds to 
n = 2; and so on. Let Q n be the critical value of the quality-factor Q at which the nth 
bifurcation is triggered. Table 15.1 shows the Q n , determined from Figures 15.21 and 
15.22, for n = 1 to 5. Also shown is the ratio 



for n = 3 to 5. It can be seen that Table 15.1 offers reasonably convincing evidence that 
this ratio takes the constant value F = 4.69. It follows that we can estimate the critical 
Q -value required to trigger the nth bifurcation via the following formula: 



Qn 1 ~ Qn 2 
Qn " Qn 1 



(15.24) 




(15.25) 



for n > 1 . Note that the distance (in Q) between bifurcations decreases rapidly as n 
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Figure 15.22: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 0(0) = 0, 
v(0) = 0, and 4> = 0. 

increases. In fact, the above formula predicts an accumulation of period-doubling bifurca- 
tions at Q = Qoo, where 

Qoo = Qi + (Q 2 - Qi ) Y. p = Qi + (02 " Qi ) fZT] = 1 - 3758 - (15 - 26) 

3=0 

Note that our calculated accumulation point corresponds almost exactly to the onset of 
the solid black region in Figure 15.22. By the time that Q exceeds Q^, we expect the 
attractor to have been converted into a period-infinity attractor via an infinite series of 
period-doubling bifurcations. A period-infinity attractor is one whose corresponding mo- 
tion never repeats itself, no matter how long we wait. In dynamics, such bounded aperiodic 
motion is generally referred to as chaos. Hence, a period-infinity attractor is sometimes 
called a chaotic attractor. Now, period-n motion is represented by n separate curves in 
Figure 15.22. It is, therefore, not surprising that chaos (i.e., period-infinity motion) is rep- 
resented by an infinite number of curves which merge together to form a region of solid 
black. 

Let us examine the onset of chaos in a little more detail. Figures 15.23-15.26 show 
details of the pendulum's time-asymptotic motion at various stages on the period-doubling 
cascade discussed above. Figure 15.23 shows period-4 motion: note that the Poincare 
section consists of four points, and the associated sequence of net rotations per period 
of the pendulum repeats itself every four periods. Figure 15.24 shows period-8 motion: 
now the Poincare section consists of eight points, and the rotation sequence repeats itself 
every eight periods. Figure 15.25 shows period-16 motion: as expected, the Poincare 
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Bifurcation 


n 


Qn 


Qn " Qn-1 




period- 1 — >period-2 


1 


1 .34870 






period-2— >period-4 


2 


1 .37003 


0.02133 




period-4— >period-8 


3 


1 .37458 


0.00455 


4.69 ±0.01 


period-8— >period-16 


4 


1 .37555 


0.00097 


4.69 ± 0.04 


period- 16— >period-32 


5 


1 .37575 


0.00020 


4.9 ± 0.20 



Table 15.1: The period-doubling cascade. 
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Figure 15.23: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1.372, A = 1.5, to = 2/3, 0(0) = 0, v(0) = 0, and 4> = 0. Also, shown is the net 
rotation per period, AQ/2n, calculated at the Poincare phase cf) = 0. 

section consists of sixteen points, and the rotation sequence repeats itself every sixteen 
periods. Finally, Figure 15.26 shows chaotic motion. Note that the Poincare section now 
consists of a set of four continuous line segments, which are, presumably, made up of an 
infinite number of points (corresponding to the infinite period of chaotic motion) . Note, 
also, that the associated sequence of net rotations per period shows no obvious sign of ever 
repeating itself. In fact, this sequence looks rather like one of the previously shown periodic 
sequences with the addition of a small random component. The generation of apparently 
random motion from equations of motion, such as Equations (15.10) and (15.11), which 
contain no overtly random elements is one of the most surprising features of non-linear 
dynamics. 

Many non-linear dynamical systems found in nature exhibit a transition from periodic 
to chaotic motion as some control parameter is varied. Moreover, there are various known 
mechanisms by which chaotic motion can arise from periodic motion. A transition to chaos 
via an infinite series of period-doubling bifurcations, as illustrated above, is one of the most 
commonly occurring mechanisms. Around 1975, the physicist Mitchell Feigenbaum was 
investigating a simple mathematical model, known as the logistic map, which exhibits a 
transition to chaos, via a sequence of period-doubling bifurcations, as a control parameter r 
is increased. Let r n be the value of r at which the first 2 n -period cycle appears. Feigenbaum 



The Chaotic Pendulum 



259 




Figure 15.24: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1.375, A = 1.5, (V = 2/3, 6(0) = 0, v(0) = 0, and 4> = 0. Ako, shown is the net 
rotation per period, AQ/2n, calculated at the Poincare phase 4) = 0. 




Figure 15.25: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1.3757, A = 1.5, a> = 2/3, 9(0) = 0, v(0) = 0, and $ = 0. A/so, shown is the net 
rotation per period, Ad/2n, calculated at the Poincare phase 4> = 0. 
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Figure 15.26: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1.376, A = 1.5, cv = 2/3, 9(0) = 0, v(0) = 0, and 4> = 0. Ako, shown is the net 
rotation per period, AQ/2n, calculated at the Poincare phase cf) = 0. 

noticed that the ratio 

F n = Tn - ] - Tn ~ 2 (15.27) 
r n - r n _i 

converges rapidly to a constant value, F = 4.669, as n increases. Feigenbaum was able 
to demonstrate that this value of F is common to a wide range of different mathematic 
models which exhibit transitions to chaos via period-doubling bifurcations. 6 Feigenbaum 
went on to argue that the Feigenbaum ratio, F n , should converge to the value 4.669 in 
any dynamical system exhibiting a transition to chaos via period-doubling bifurcations. 7 
This amazing prediction has been verified experimentally in a number of quite different 
physical systems. 8 Note that our best estimate of the Feigenbaum ratio (see Table 15.1) is 
4.69 ± 0.01, in good agreement with Feigenbaum's prediction. 

The existence of a universal ratio characterizing the transition to chaos via period- 
doubling bifurcations is one of many pieces of evidence indicating that chaos is a universal 
phenomenon (i.e., the onset and nature of chaotic motion in different dynamical systems 
has many common features). This observation encourages us to believe that in studying 
the chaotic motion of a damped periodically driven pendulum we are learning lessons 
which can be applied to a wide range of non-linear dynamical systems. 



15.10 Sensitivity to Initial Conditions 

Suppose that we launch our pendulum and then wait until its motion has converged onto 
a particular attractor. The subsequent motion can be visualized as a trajectory 9 (t), v (t) 

6 M.J. Feigenbaum, Quantitative universality for a class of nonlinear transformations, J. Stat. Phys. 19, 25 

(1978) . 

7 M.J. Feigenbaum, The universal metric properties of nonlinear transformations, J. Stat. Phys. 21, 69 

(1979) . 

8 P. Citanovic, Universality in chaos, (Adam Hilger, Bristol UK, 1989). 
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through phase-space. Suppose that we then somehow perturb the pendulum, at time 
t = t , such that its position in phase-space is instantaneously changed from o (t o ), v (t ) 
to 9 (t ) +69 , v (t ) + 6v . The subsequent motion can be visualized as a second trajectory 
61 (t), V] (t) through phase-space. What is the relationship between the original trajectory 
9o(t), vo(t) and the perturbed trajectory 0i(t), vi(t)? In particular, does the phase-space 
separation between the two trajectories, whose components are 

50(At) = 0i(t o + At) -0 o (t o + At), (15.28) 

Sv(At) = vi(t + At) -v (t + At), (15.29) 

grow in time, decay in time, or stay more or less the same? What we are really investigating 
is how sensitive the time-asymptotic motion of the pendulum is to initial conditions. 
According to the linear analysis of Section 15.3, 

60(At) = 50 o cos(a),At) e- At/2( 2 

+— Uv + ^} sin(cu* At) e- At/2( 2, (15.30) 
{ 2QJ 

Sv(At) = 5v cos(cu,At) e- At/2( 2 

-— (S0o + ^r) sin(a), At) e^Q, (15.31) 
w* { 2QJ 

assuming sin(cu* to) =0. It is clear that, in the linear regime, at least, the pendulum's time- 
asymptotic motion is not particularly sensitive to initial conditions. In fact, if we move the 
pendulum's phase-space trajectory slightly off the linear attractor, as described above, then 
the perturbed trajectory decays back to the attractor exponentially in time. In other words, 
if we wait long enough then the perturbed and unperturbed motions of the pendulum 
become effectively indistinguishable. Let us now investigate whether this insensitivity to 
initial conditions carries over into the non-linear regime. 

Figures 15.27-15.30 show the results of the experiment described above, in which the 
pendulum's phase-space trajectory is moved slightly off an attractor and the phase-space 
separation between the perturbed and unperturbed trajectories is then monitored as a 
function of time, at various stages on the period-doubling cascade discussed in the previous 
section. To be more exact, the figures show the logarithm of the absolute magnitude of 
the v-component of the phase-space separation between the perturbed and unperturbed 
trajectories as a function of normalized time. 

Figure 15.27 shows the time evolution of the v-component of the phase-space sepa- 
ration, Sv, between two neighbouring trajectories, one of which is the period-4 attractor 
illustrated in Figure 15.23. It can be seen that Sv decays rapidly in time. In fact, the graph 
of log(|6v|) versus At can be plausibly represented as a straight-line of gradient A. In other 
words, 

|6v(At)| ~ Sv e AA \ (15.32) 

where the quantity A is known as the Liapunov exponent, and is obviously negative. Clearly, 
in this case, A measures the strength of the exponential convergence of the two trajectories 
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Figure 15.27: The v-component of the separation between two neighbouring phase-space tra- 
jectories (one of which lies on an attractor) plotted against normalized time. Data calculated 
numerically for Q = 1.372, A = 1.5, iv = 2/3, 9(0) = 0, and v(0) = 0. The separation 
between the two trajectories is initialized to 60 o = 6v = 10~ 6 at At = 0. 
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Figure 15.28: The v-component of the separation between two neighbouring phase-space tra- 
jectories (one of which lies on an attractor) plotted against normalized time. Data calculated 
numerically for Q = 1.375, A = 1.5, tv = 2/3, 9(0) = 0, and v(0) = 0. The separation 
between the two trajectories is initialized to 59 = Sv = 10~ 6 at At = 0. 
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Figure 15.29: The v-component of the separation between two neighbouring phase-space tra- 
jectories (one of which lies on an attractor) plotted against normalized time. Data calculated 
numerically for Q = 1.3757, A = 1.5, iv = 2/3, 0(0) = 0, and v(0) = 0. The separation 
between the two trajectories is initialized to 8Q = Sv = 10~ 6 at At = 0. 
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Figure 15.30: The v-component of the separation between two neighbouring phase-space tra- 
jectories (one of which lies on an attractor) plotted against normalized time. Data calculated 
numerically for Q = 1.376, A = 1.5, tv = 2/3, 9(0) = 0, and v(0) = 0. The separation 
between the two trajectories is initialized to 59 = Sv = 10~ 6 at At = 0. 
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in phase-space. Of course, the graph of log(|5v|) versus At is not exactly a straight-line. 
There are deviations due to the fact that Sv oscillates, as well as decays, in time. There 
are also deviations because the strength of the exponential convergence between the two 
trajectories varies along the attractor. 

The above definition of the Liapunov exponent is rather inexact, for two main rea- 
sons. In the first place, the strength of the exponential convergence/divergence between 
two neighbouring trajectories in phase-space, one of which is an attractor, generally varies 
along the attractor. Hence, we should really take formula (15.32) and somehow average 
it over the attractor, in order to obtain a more unambiguous definition of A. In the second 
place, since the dynamical system under investigation is a second-order system, it actually 
possesses two different Liapunov exponents. Consider the evolution of an infinitesimal cir- 
cle of perturbed initial conditions, centred on a point in phase-space lying on an attractor. 
During its evolution, the circle will become distorted into an infinitesimal ellipse. Let 6k, 
where k = 1,2, denote the phase-space length of the kth principal axis of the ellipse. The 
two Liapunov exponents, A] and A 2 , are defined via S k (At) ~ 6 k (0) exp(A k At). However, 
for large At, the diameter of the ellipse is effectively controlled by the Liapunov exponent 
with the most positive real part. Hence, when we refer to the Liapunov exponent, A, what 
we generally mean is the Liapunov exponent with the most positive real part. 

Figure 15.28 shows the time evolution of the v-component of the phase-space sepa- 
ration, Sv, between two neighbouring trajectories, one of which is the period-8 attractor 
illustrated in Figure 15.24. It can be seen that Sv decays in time, though not as rapidly as 
in Figure 15.27. Another way of saying this is that the Liapunov exponent of the periodic 
attractor shown in Figure 15.24 is negative (i.e., it has a negative real part), but not as 
negative as that of the periodic attractor shown in Figure 15.23. 

Figure 15.29 shows the time evolution of the v-component of the phase-space separa- 
tion, Sv, between two neighbouring trajectories, one of which is the period- 16 attractor 
illustrated in Figure 15.25. It can be seen that Sv decays weakly in time. In other words, 
the Liapunov exponent of the periodic attractor shown in Figure 15.25 is small and nega- 
tive. 

Finally, Figure 15.30 shows the time evolution of the v-component of the phase-space 
separation, Sv, between two neighbouring trajectories, one of which is the chaotic attractor 
illustrated in Figure 15.26. It can be seen that Sv increases in time. In other words, the 
Liapunov exponent of the chaotic attractor shown in Figure 15.26 is positive. Further 
investigation reveals that, as the control parameter Q is gradually increased, the Liapunov 
exponent changes sign and becomes positive at exactly the same point that chaos ensues 
in Figure 15.22. 

The above discussion strongly suggests that periodic attractors are characterized by 
negative Liapunov exponents, whereas chaotic attractors are characterized by positive ex- 
ponents. But, how can an attractor have a positive Liapunov exponent? Surely, a positive 
exponent necessarily implies that neighbouring phase-space trajectories diverge from the 
attractor (and, hence, that the attractor is not a true attractor)? It turns out that this is not 
the case. The chaotic attractor shown in Figure 15.26 is a true attractor, in the sense that 
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neighbouring trajectories rapidly converge onto it — i.e., after a few periods of the external 
drive their Poincare sections plot out the same four-line segment shown in Figure 15.26. 
Thus, the exponential divergence of neighbouring trajectories, characteristic of chaotic at- 
tractors, takes place within the attractor itself. Obviously, this exponential divergence must 
come to an end when the phase-space separation of the trajectories becomes comparable 
to the extent of the attractor. 

A dynamical system characterized by a positive Liapunov exponent, A, has a time hori- 
zon beyond which regular deterministic prediction breaks down. Suppose that we measure 
the initial conditions of an experimental system very accurately. Obviously, no measure- 
ment is perfect: there is always some error 6 between our estimate and the true initial 
state. After a time t, the discrepancy grows to 6(t) ~ So exp(A t). Let a be a measure of our 
tolerance: i.e., a prediction within a of the true state is considered acceptable. It follows 
that our prediction becomes unacceptable when 6 > a, which occurs when 



Note the logarithmic dependence on 6 . This ensures that, in practice, no matter how hard 
we work to reduce our initial measurement error, we cannot predict the behaviour of the 
system for longer than a few multiples of 1 /A. 

It follows, from the above discussion, that chaotic attractors are associated with motion 
which is essentially unpredictable. In other words, if we attempt to integrate the equations 
of motion of a chaotic system then even the slightest error made in the initial conditions 
will be amplified exponentially over time, and will rapidly destroy the accuracy of our 
prediction. Eventually, all that we will be able to say is that the motion lies somewhere on 
the chaotic attractor in phase-space, but exactly where it lies on the attractor at any given 
time will be unknown to us. 

The hyper-sensitivity of chaotic systems to initial conditions is sometimes called the but- 
terfly effect. The idea is that a butterfly flapping its wings in a South American rain-forest 
could, in principle, affect the weather in Texas (since the atmosphere exhibits chaotic 
dynamics) . This idea was first publicized by the meteorologist Edward Lorenz, who con- 
structed a very crude model of the convection of the atmosphere when it is heated from 
below by the ground. 9 Lorenz discovered, much to his surprise, that his model atmosphere 
exhibited chaotic motion — which, at that time, was virtually unknown. In fact, Lorenz was 
essentially the first scientist to fully understand the nature and ramifications of chaotic 
motion in physical systems. In particular, Lorenz realized that the chaotic dynamics of the 
atmosphere spells the doom of long-term weather forecasting: the best one can hope to 
achieve is to predict the weather a few days in advance (1 /A for the atmosphere is of order 
a few days) . 



9 E. Lorenz, Deterministic nonperiodicflow, J. Atmospheric Science 20, 130 (1963). 




(15.33) 
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15.11 Definition of Chaos 

There is no universally agreed definition of chaos. However, most people would accept the 
following working definition: 

Chaos is aperiodic time-asymptotic behaviour in a deterministic system which 
exhibits sensitive dependence on initial conditions. 

This definition contains three main elements: 

1. Aperiodic time-asymptotic behaviour — this implies the existence of phase-space tra- 
jectories which do not settle down to fixed points or periodic orbits. For practical 
reasons, we insist that these trajectories are not too rare. We also require the trajec- 
tories to be bounded: i.e., they should not go off to infinity. 

2. Deterministic — this implies that the equations of motion of the system possess no 
random inputs. In other words, the irregular behaviour of the system arises from 
non-linear dynamics, and not from noisy driving forces. 

3. Sensitive dependence on initial conditions — this implies that nearby trajectories in 
phase-space separate exponentially fast in time: i.e., that the system has a positive 
Liapunov exponent. 

15.12 Periodic Windows 

Let us return to Figure 15.22. Recall, that this figure shows the onset of chaos, via a 
cascade of period-doubling bifurcations, as the quality-factor Q is gradually increased. 
Figure 15.31 is essentially a continuation of Figure 15.22 which shows the full extent of 
the chaotic region (in Q-v space). It can be seen that the chaotic region ends abruptly 
when Q exceeds a critical value, which is about 1.4215. Beyond this critical value, the 
time-asymptotic motion appears to revert to period- 1 motion (i.e., the solid black region 
collapses to a single curve). It can also be seen that the chaotic region contains many 
narrow windows in which chaos reverts to periodic motion (i.e., the solid black region 
collapses to n curves, where n is the period of the motion) for a short interval in Q. The 
four widest windows are indicated in the figure. 

Figure 15.32 is a blow-up of the period-3 window shown in Figure 15.31. It can be 
seen that the window appears "out of the blue" as Q is gradually increased. However, it 
can also be seen that, as Q is further increased, the window breaks down, and eventually 
disappears, due to the action of a cascade of period-doubling bifurcations. The same 
basic mechanism operates here as in the original period-doubling cascade, discussed in 
Section 15.9, except that now the orbits are of period 3 ■ 2 n , instead of 2 ■ 2 n . Note that all 
of the other periodic windows seen in Figure 15.31 break down in an analogous manner, 
as Q is increased. 
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Figure 15.31: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 9(0) = 0, 
v(0) = -0.75, and $ = 0. 
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Figure 15.32: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 9(0) = 0, 
v(0) = -0.75, and $ = 0. 
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Figure 15.33: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1 387976, A = 1 .5, a> = 2/3, 9(0) = 0, v(0) = -0.75, and 4) = 0. Also, shown is the 
net rotation per period, AQ/2n, calculated at the Poincare phase (J) — 0. 
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Figure 15.34: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1 .387977, A = 1 .5, iv = 2/3, 9(0) = 0, v(0) = -0.75, and 4> = 0. Ako, s/iown is the 
net rotation per period, AQ/2n, calculated at the Poincare phase 4> = 0. 
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Figure 15.35: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 1 .387978, A = 1 .5, to = 2/3, 9(0) = 0, v(0) = -0.75, and cb = 0. A/so, shown is the 
net rotation per period, AQ/2n, calculated at the Poincare phase 4> = 0. 

We now understand how periodic windows break down. But, how do they appear in the 
first place? Figures 15.33-15.35 show details of the pendulum's time-asymptotic motion 
calculated just before the appearance of the period-3 window (shown in Figure 15.32), just 
at the appearance of the window, and just after the appearance of the window, respectively. 
It can be seen, from Figure 15.33, that just before the appearance of the window the 
attractor is chaotic (i.e., its Poincare section consists of a line, rather than a discrete set of 
points), and the time-asymptotic motion of the pendulum consists of intervals of period-3 
motion interspersed with intervals of chaotic motion. Figure 15.34 shows that just at the 
appearance of the window the attractor loses much of its chaotic nature (i.e., its Poincare 
section breaks up into a series of points), and the chaotic intervals become shorter and 
much less frequent. Finally, Figure 15.35 shows that just after the appearance of the 
window the attractor collapses to a period-3 attractor, and the chaotic intervals cease 
altogether. All of the other periodic windows seen in Figure 15.31 appear in an analogous 
manner to that just described. 

According to the above discussion, the typical time-asymptotic motion seen just prior to 
the appearance of a period-n window consists of intervals of period-n motion interspersed 
with intervals of chaotic motion. This type of behaviour is called intermittency , and is 
observed in a wide variety of non-linear systems. As we move away from the window, 
in parameter space, the intervals of periodic motion become gradually shorter and more 
infrequent. Eventually, they cease altogether. Likewise, as we move towards the window, 
the intervals of periodic motion become gradually longer and more frequent. Eventually, 
the whole motion becomes periodic. 

In 1973, Metropolis and co-workers investigated a class of simple mathematical models 
which all exhibit a transition to chaos, via a cascade of period-doubling bifurcations, as 
some control parameter r is increased. 10 They were able to demonstrate that, for these 

10 N. Metropolis, M.L. Stein, and P.R. Stein, On finite limit sets for transformations on the unit interval, J. 
Combin. Theor. 15, 25 (1973). 
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models, the order in which stable periodic orbits occur as r is increased is fixed. That is, 
stable periodic attractors always occur in the same sequence as r is varied. This sequence 
is called the universal or U -sequence. It is possible to make a fairly convincing argument 
that any physical system which exhibits a transition to chaos via a sequence of period- 
doubling bifurcations should also exhibit the U-sequence of stable periodic attractors. Up 
to period-6, the U-sequence is 

1 , 2, 2 x 2, 6, 5, 3, 2 x 3, 5, 6, 4, 6, 5, 6. 

The beginning of this sequence is familiar: periods 1, 2, 2 x 2 are the first stages of the 
period-doubling cascade. (The later period-doublings give rise to periods greater than 6, 
and so are omitted here). The next periods, 6,5,3 correspond to the first three periodic 
windows shown in Figure 15.31. Period 2 x 3 is the first component of the period-doubling 
cascade which breaks up the period-3 window. The next period, 5, corresponds to the last 
periodic window shown in Figure 15.31. The remaining periods, 6, 4, 6, 5, 6, correspond 
to tiny periodic windows, which, in practice, are virtually impossible to observe. It follows 
that our driven pendulum system exhibits the U-sequence of stable periodic orbits fairly 
convincingly. This sequence has also been observed experimentally in other, quite differ- 
ent, dynamical systems. 11 The existence of a universal sequence of stable periodic orbits 
in dynamical systems which exhibit a transition to chaos via a cascade of period-doubling 
bifurcations is another indication that chaos is a universal phenomenon. 

15.13 Further Investigation 

Figure 15.36 shows the complete bifurcation diagram for the damped, periodically driven, 
pendulum (with A = 1 .5 and cu = 2/3). It can be seen that the chaotic region investigated 
in the previous section is, in fact, the first, and least extensive, of three different chaotic 
regions. 

The interval between the first and second chaotic regions is occupied by the period- 
1 orbits shown in Figure 15.37. Note that these orbits differ somewhat from previously 
encountered period- 1 orbits in that the pendulum executes a complete rotation (either to 
the left or to the right) every period of the external drive. Now, an n, I periodic orbit is 
defined such that 

e(t + nt) = 9(t) +2nl 

for all t (after the transients have died away) . It follows that all of the periodic orbits 
which we encountered in previous sections were 1 = orbits: i.e., their associated motions 
did not involve a net rotation of the pendulum. The orbits show in Figure 15.37 are 
n = 1,1 = — 1 and n = 1,1 = +1 orbits, respectively. The existence of periodic orbits in 
which the pendulum undergoes a net rotation, either to the left or to the right, is another 
example of spatial symmetry breaking — there is nothing in the pendulum's equations of 
motion which distinguishes between the two possible directions of rotation. 

11 R.H. Simoyi, A. Wolf, and H.L. Swinney One-dimensional dynamics in a multi-component chemical reac- 
tion, Phys. Rev. Lett. 49, 245 (1982). 
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Figure 15.36: The v-coordinate of the Poincare section of a time-asymptotic orbit plotted 
against the quality-factor Q. Data calculated numerically for A = 1.5, cu = 2/3, 0(0) = 0, 
v(0) = 0, and cj) = 0. 





Figure 15.37: Equally spaced (in time) points on a time-asymptotic orbit in phase-space. 
Data calculated numerically for Q = 1 .5, A = 1 .5, cu = 2/3, 9(0) = 0, and v(0) = 0. AZso 
shown is t/ie time-asymptotic orbit calculated for the modified initial conditions 6(0) = 0, and 
v(0)=-l. 
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Figure 15.38: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q =2.13, A = 1.5, o> = 2/3, 9(0) = 0, v(0) = 0, and $ = 0. 

Figure 15.38 shows the Poincare section of a typical attractor in the second chaotic 
region shown in Figure 15.36. It can be seen that this attractor is far more convoluted 
and extensive than the simple 4-line chaotic attractor pictured in Figure 15.26. In fact, 
the attractor shown in Figure 15.38 is clearly a fractal curve. It turns out that virtually all 
chaotic attractors exhibit fractal structure. 

The interval between the second and third chaotic regions shown in Figure 15.36 is 
occupied by n = 3, I = periodic orbits. Figure 15.39 shows the Poincare section of a 
typical attractor in the third chaotic region. It can be seen that this attractor is even more 
overtly fractal in nature than that pictured in the previous figure. Note that the fractal 
nature of chaotic attractors is closely associated with some of their unusual properties. 
Trajectories on a chaotic attractor remain confined to a bounded region of phase-space, 
and yet they separate from their neighbours exponentially fast (at least, initially) . How 
can trajectories diverge endlessly and still stay bounded? The basic mechanism is described 
below. If we imagine a blob of initial conditions in phase-space then these undergo a series 
of repeated stretching and folding episodes, as the chaotic motion unfolds. The stretching is 
what gives rise to the divergence of neighbouring trajectories. The folding is what ensures 
that the trajectories remain bounded. The net result is a phase-space structure which looks 
a bit like filo pastry — in other words, a fractal structure. 
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Figure 15.39: The Poincare section of a time-asymptotic orbit. Data calculated numerically 
for Q = 3.9, A = 1 .5, cu = 2/3, 9(0) = 0, v(0) = 0, and $ = 0. 



274 NEWTONIAN DYNAMICS 



Vector Algebra and Vector Calculus 



275 



A Vector Algebra and Vector Calculus 



A. 1 Introduction 

This appendix contains a brief outline of vector algebra and vector calculus. The essential 
purpose of vector algebra is to convert the propositions of Euclidean geometry in three- 
dimensional space into a convenient algebraic form. Vector calculus allows us to define the 
instantaneous velocity and acceleration of a moving object in three-dimensional space, as 
well as the work done when such an object travels along a general curved trajectory in a 
force field. Vector calculus also introduces the concept of a scalar field: e.g., the potential 
energy associated with a conservative force field. 



A. 2 Scalars and Vectors 

Many physical quantities (e.g., mass, energy) are entirely defined by a numerical mag- 
nitude (in appropriate units). Such quantities, which have no directional element, are 
known as scalars. Moreover, since scalars can be represented by real numbers, it follows 
that they obey the familiar laws of ordinary algebra. However, there exits a second class 
of physical quantities (e.g., velocity, acceleration, force) which are only completely defined 
when both a numerical magnitude and a direction in space is specified. Such quantities 

are known as vectors. By definition, a vector obeys the same algebra as a displacement 

— > 

in space, and may thus be represented geometrically by a straight-line, PQ (say), where 
the arrow indicates the direction of the displacement (i.e., from point P to point Q) — see 
Figure A.l. The magnitude of the vector is represented by the length of the straight-line. 

It is conventional to denote vectors by bold-faced symbols (e.g., a, F) and scalars by 
non-bold-faced symbols (e.g., r, S). The magnitude of a general vector, a, is denoted |a|, 
or just a, and is, by definition, always greater than or equal to zero. It is convenient to 
define a vector with zero magnitude — this is denoted 0, and has no direction. Finally, two 
vectors, a and b, are said to be equal when their magnitudes and directions are identical. 

P 

Figure A. 1 : A vector. 
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Figure A.2: Vector addition. 



A. 3 Vector Algebra 

— > — > 

Suppose that the displacements PQ and QR represent the vectors a and b, respectively — 

see Figure A.2. It can be seen that the result of combining these two displacements is to 

— > — > 

give the net displacement PR. Hence, if PR represents the vector c then we can write 

c = a + b. (A.l) 
This defines vector addition. By completing the parallelogram PQRS, we can also see that 

PR = PQ + QR = PS + SR . (A.2) 

— > — > 
However, PS has the same length and direction as QR, and, thus, represents the same 

— > — > 

vector, b. Likewise, PQ and SR both represent the vector a. Thus, the above equation is 
equivalent to 

c = a + b = b + a. (A.3) 

We conclude that the addition of vectors is commutative. It can also be shown that the 
associative law holds: i.e., 

a+ (b + c) = (a + b) + c. (A.4) 

The null vector, 0, is represented by a displacement of zero length and arbitrary direc- 
tion. Since the result of combining such a displacement with a finite length displacement 
is the same as the latter displacement by itself, it follows that 

a + = a, (A.5) 

where a is a general vector. The negative of a is defined as that vector which has the same 
magnitude, but acts in the opposite direction, and is denoted —a. The sum of a and —a is 
thus the null vector: i.e., 

a+(-a)=0. (A.6) 
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c = a — b 




Figure A.3: Vector subtraction. 



We can also define the difference of two vectors, a and b, as 

c = a-b = a+(-b). (A.7) 

This definition of vector subtraction is illustrated in Figure A.3. 

If n > is a scalar then the expression na denotes a vector whose direction is the 
same as a, and whose magnitude is n times that of a. (This definition becomes obvious 
when n is an integer.) If n is negative then, since na = \n\ (—a), it follows that na is a 
vector whose magnitude is |n| times that of a, and whose direction is opposite to a. These 
definitions imply that if n and m are two scalars then 

u(ma) = nma = m(na), (A.8) 
(n + m)a = na + ma, (A.9) 
n(a + b) = na + nb. (A.10) 



A.4 Cartesian Components of a Vector 

Consider a Cartesian coordinate system Oxuz consisting of an origin, O, and three mutu- 
ally perpendicular coordinate axes, Ox, Ou, and Oz — see Figure A.4. Such a system is said 
to be right-handed if, when looking along the Oz direction, a 90° clockwise rotation about 
Oz is required to take Ox into Ou. Otherwise, it is said to be left-handed. In physics, it is 
conventional to always use right-handed coordinate systems. 

It is convenient to define unit vectors, e x , e y , and e z , parallel to Ox, Ou, and Oz, 
respectively. Incidentally, a unit vector is a vector whose magnitude is unity. The position 
vector, r, of some general point P whose Cartesian coordinates are (x, y, z) is then given 

by 

r = xe z + y e y + ze z . (A.ll) 

In other words, we can get from O to P by moving a distance x parallel to Ox, then 
a distance y parallel to Ou, and then a distance z parallel to Oz. Similarly, if a is an 
arbitrary vector then 

a = a x e x + a y e y + a z e z , (A.12) 
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Figure A.4: A right-handed Cartesian coordinate system. 

where a x , a v , and a z are termed the Cartesian components of a. It is coventional to write 
a= (a x , a y , a z ). It follows that e x = (1,0, 0), e y = (0, 1, 0), and e z = (0, 0, 1). Of course, 
0= (0,0,0). 

According to the three-dimensional generalization of the Pythagorean theorem, the 
distance OP = |r| = r is given by 

r = yjx 2 +y 2 + z 2 . (A.13) 
By analogy, the magnitude of a general vector a takes the form 

a = ^a 2 + a 2 + a 2 . (A.14) 
If a = (a x , a y , a z ) and b = (b x , b y , b z ) then it is easily demonstrated that 

a + b = (a x + b x , a y + b y , a z + b z ). (A.15) 
Furthermore, if n is a scalar then it is apparent that 

na = (n a x , n a y , n a z ). (A.16) 

A. 5 Coordinate Transformations 

A Cartesian coordinate system allows position and direction in space to be represented in a 
very convenient manner. Unfortunately, such a coordinate system also introduces arbitrary 
elements into our analysis. After all, two independent observers might well choose coor- 
dinate systems with different origins, and different orientations of the coordinate axes. In 
general, a given vector a will have different sets of components in these two coordinate 
systems. However, the direction and magnitude of a are the same in both cases. Hence, 
the two sets of components must be related to one another in a very particular fashion. Ac- 
tually, since vectors are represented by moveable line elements in space (i.e., in Figure A.2, 
— > — > 

PQ and SR represent the same vector), it follows that the components of a general vector 
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Figure A.5: Rotation of the coordinate axes about Oz. 

are not affected by a simple shift in the origin of a Cartesian coordinate system. On the 
other hand, the components are modified when the coordinate axes are rotated. 

Suppose that we transform to a new coordinate system, Ox'y'z', which has the same 
origin as Oxyz, and is obtained by rotating the coordinate axes of Oxyz through an angle 
8 about Oz — see Figure A.5. Let the coordinates of a general point P be (x, y, z) in Oxyz 
and (x', y ', z') in Ox'y'z'. According to simple trigonometry, these two sets of coordinates 
are related to one another via the transformation: 

x' = xcos9+ysin9, (A.17) 
y' = -x sin 9 + y cos 9, (A.18) 
z' = z. (A. 19) 

— > 

Consider the vector displacement r = OP. Note that this displacement is represented by 
the same symbol, r, in both coordinate systems, since the magnitude and direction of r are 
manifestly independent of the orientation of the coordinate axes. The coordinates of r do 
depend on the orientation of the axes: i.e., r = (x, y, z) in Oxyz, and r = (x', y', z') in 
Ox'y'z'. However, they must depend in a very specific manner [i.e., Equations (A. 17)- 
(A.19)] which preserves the magnitude and direction of r. 

The components of a general vector a transform in an analogous manner to Equa- 
tions (A.17)-(A.19): i.e., 

a x > = a x cos9 + a y sin 9, (A.20) 
a y / = — a x sin9 + a y cos 9, (A.21) 
a 2 , = a z . (A.22) 

Moreover, there are similar transformation rules for rotation about Ox and Oy. Equa- 
tions (A.20)-(A.22) effectively constitute the definition of a vector: i.e., the three quantities 
(a x , a y , a z ) are the components of a vector provided that they transform under rotation 
of the coordinate axes about Oz in accordance with Equations (A.20)-(A.22). (And also 
transform correctly under rotation about Ox and Oy). Conversely, (a x , a y , a z ) cannot be 
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the components of a vector if they do not transform in accordance with Equations (A.20)- 
(A.22). Of course, scalar quantities are invariant under rotation of the coordinate axes. 
Thus, the individual components of a vector (a x , say) are real numbers, but they are not 
scalars. Displacement vectors, and all vectors derived from displacements (e.g., velocity, 
acceleration), automatically satisfy Equations (A.20)-(A.22). There are, however, other 
physical quantities which have both magnitude and direction, but which are not obviously 
related to displacements. We need to check carefully to see whether these quantities are 
really vectors (see Section A.8). 

A. 6 Scalar Product 

A scalar quantity is invariant under all possible rotational transformations. The individual 
components of a vector are not scalars because they change under transformation. Can 
we form a scalar out of some combination of the components of one, or more, vectors? 
Suppose that we were to define the "percent" product, 

a % b = a x b z + a y b x + a z b y = scalar number, (A.23) 

for general vectors a and b. Is a % b invariant under transformation, as must be the case 
if it is a scalar number? Let us consider an example. Suppose that a = (0, 1, 0) and b = 
(1 , 0, 0). It is easily seen that a%b = 1 . Let us now rotate the coordinate axes through 45° 
about Oz. In the new coordinate system, a = (1 /VI, 1 /VI, 0) and b = (1/VI, —1 /VI, 0), 
giving a%b = 1 /2. Clearly, a%b is not invariant under rotational transformation, so the 
above definition is a bad one. 

Consider, now, the dot product or scalar product: 

a b = a x b x + a y b y + a z b z = scalar number. (A. 24) 

Let us rotate the coordinate axes though degrees about Oz. According to Equations (A.20)- 
(A.22), a • b takes the form 

a b = (a x cosG + a y sin 9) (b x cos0 + b y sin 9) 

+(— a x sin 9 + a y cos0) (— b x sin0 + b y cos0) + a z b z 
= a x b x + a y by + a z b z (A.25) 

in the new coordinate system. Thus, a b is invariant under rotation about Oz. It can easily 
be shown that it is also invariant under rotation about Ox and Ou . We conclude that a ■ b 
is a true scalar, and that the above definition is a good one. Incidentally, a ■ b is the only 
simple combination of the components of two vectors which transforms like a scalar. It is 
easily shown that the dot product is commutative and distributive: i.e., 



a b = b a, 
a • (b + c) = a b + a c. 



(A.26) 
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Figure A. 6: A vector triangle. 



The associative property is meaningless for the dot product, because we cannot have (a ■ 
b) • c, since a • b is scalar. 

We have shown that the dot product a ■ b is coordinate independent. But what is the 
geometric significance of this? Well, in the special case where a = b, we get 

a • b = a x 2 + a y 2 + a z 2 = |a| 2 = a 2 . (A.27) 

So, the invariance of a ■ a is equivalent to the invariance of the magnitude of vector a under 
transformation. 

Let us now investigate the general case. The length squared of AB in the vector triangle 
shown in Figure A. 6 is 

(b - a) • (b - a) = |a| 2 + |b| 2 - 2 a b. (A.28) 

However, according to the "cosine rule" of trigonometry, 

(AB) 2 = (OA) 2 + (OB) 2 - 2 (OA) (OB) cos 9, (A.29) 

where (AB) denotes the length of side AB. It follows that 

a b = |a||b| cos 9. (A.30) 

In this case, the invariance of a • b under transformation is equivalent to the invariance of 
the angle subtended between the two vectors. Note that if a • b = then either |a| = 0, 
|b| = 0, or the vectors a and b are mutually perpendicular. The angle subtended between 
two vectors can easily be obtained from the dot product: i.e., 

a b 

cos Q = ——. (A.31) 
|a| |b| 

The work W performed by a constant force F which moves an object through a dis- 
placement r is the product of the magnitude of F times the displacement in the direction 
of F. If the angle subtended between F and r is 9 then 

W= |F| (|r| cos 9) = F r. (A.32) 

The work dW performed by a non-constant force f which moves an object through 
an infinitesimal displacement dr in a time interval dt is dW = f • dr. Thus, the rate at 
which the force does work on the object, which is usually referred to as the power, is 
P = dW/dt = f • dr/dt, or P = f • v, where v = dr/dt is the object's instantaneous velocity. 
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A. 7 Vector Product 

We have discovered how to construct a scalar from the components of two general vectors 
a and b. Can we also construct a vector which is not just a linear combination of a and b? 
Consider the following definition: 

a*b = (a x b x , a y b y , a z b z ). (A.33) 

Is a * b a proper vector? Suppose that a = (0, 1, 0), b = (1, 0, 0). In this case, a * b = 0. 
However, if we rotate the coordinate axes through 45° about Oz then a = (1 /V2, 1 /v^, 0), 
b = (1/V2, — l/v 7 ^, 0), and a * b = (1/2, —1 /2, 0). Thus, a * b does not transform like a 
vector, because its magnitude depends on the choice of axes. So, above definition is a bad 
one. 

Consider, now, the cross product or vector product: 

a x b = (a y b z - a z b y , a z b x - a x b z , a x b y - a y b x ) = c. (A.34) 

Does this rather unlikely combination transform like a vector? Let us try rotating the 
coordinate axes through an angle 9 about Oz using Equations (A.20)-(A.22). In the new 
coordinate system, 

c x / = (— a x sin 9 + a y cos 9) b z — a z (— b x sin 9 + b y cos 9) 
= (a y b z - a z b y ) cos 9 + (a z b x - a x b z ) sin 9 

= c x cos 9 + c y sin 9. (A.35) 

Thus, the x-component of a x b transforms correctly. It can easily be shown that the other 
components transform correctly as well, and that all components also transform correctly 
under rotation about Ox and Oij . Thus, a x b is a proper vector. Incidentally, a x b is the 
only simple combination of the components of two vectors that transforms like a vector 
(which is non-coplanar with a and b). The cross product is anticommutative, 

a x b = — b x a, (A.36) 

distributive, 

ax(b + c)=axb + axc, (A.37) 

but is not associative, 

ax(bxc)/(axb)xc. (A.38) 

The cross product transforms like a vector, which means that it must have a well- 
defined direction and magnitude. We can show that a x b is perpendicular to both a and b. 
Consider a • a x b. If this is zero then the cross product must be perpendicular to a. Now, 

a-axb = a x (a y b z - a z b y ) + a y (a z b x - a x b z ) + a z (a x b y - a y b x ) 

= 0. (A.39) 
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Figure A. 7: The right-hand rule for cross products. Here, 9 is less that 180°. 



Therefore, a x b is perpendicular to a. Likewise, it can be demonstrated that a x b is 
perpendicular to b. The vectors a, b, and a x b form a right-handed set, like the unit 
vectors e x , e y , and e z . In fact, e x x e y = e z . This defines a unique direction for a x b, which 
is obtained from a right-hand rule — see Figure A. 7. 

Let us now evaluate the magnitude of a x b. We have 

(axb) 2 = (a y b z -a z b y ) 2 + (a z b x -a x b z ) 2 + (a x b y -a y b x ) 2 

= (a x 2 + a y 2 + a z 2 ) (b x 2 + b y 2 + b z 2 ) - (a x b x + a y b y + a z b z ) 2 
= ia| 2 |b| 2 - (a-b) 2 

= |a| 2 |b| 2 - |a| 2 |b| 2 cos 2 9 = |a| 2 |b| 2 sin 2 9. (A.40) 

Thus, 

|a x b| = |a| |b| sin 9, (A.41) 

where 9 is the angle subtended between a and b. Clearly, a x a = for any vector, since 9 
is always zero in this case. Also, if a x b = then either |a| = 0, |b| = 0, or b is parallel (or 
antiparallel) to a. 

Consider the parallelogram defined by the vectors a and b — see Figure A. 8. The scalar 
area of the parallelogram is a b sin 9. By convention, the vector area has the magnitude 
of the scalar area, and is normal to the plane of the parallelogram, in the sense obtained 
from a right-hand circulation rule by rotating a on to b (through an acute angle): i.e., if 
the fingers of the right-hand circulate in the direction of rotation then the thumb of the 
right-hand indicates the direction of the vector area. So, the vector area is coming out of 
the page in Figure A. 8. It follows that 

S = a x b, (A.42) 

Suppose that a force F is applied at position r — see Figure A. 9. The torque about 
the origin O is the product of the magnitude of the force and the length of the lever 
arm OQ. Thus, the magnitude of the torque is F [r| sin 9. The direction of the torque is 
conventionally defined as the direction of the axis through O about which the force tries to 



284 



NEWTONIAN DYNAMICS 



rotate objects, in the sense determined by a right-hand circulation rule. Hence, the torque 
is out of the page in Figure A. 9. It follows that the vector torque is given by 

t = r x F. (A.43) 

The angular momentum, 1, of a particle of linear momentum p and position vector r is 
simply defined as the moment of its momentum about the origin: i.e., 

1 = r x p. (A.44) 



A. 8 Rotation 

Let us try to define a rotation vector 9 whose magnitude is the angle of the rotation, 0, and 
whose direction is parallel to the axis of rotation, in the sense determined by a right-hand 
circulation rule. Unfortunately, this is not a good vector. The problem is that the addition 
of rotations is not commutative, whereas vector addition is commuative. Figure A. 10 
shows the effect of applying two successive 90° rotations, one about Ox, and the other 
about the Oz, to a standard six-sided die. In the left-hand case, the z-rotation is applied 
before the x-rotation, and vice versa in the right-hand case. It can be seen that the die 
ends up in two completely different states. In other words, the z-rotation plus the x- 
rotation does not equal the x-rotation plus the z-rotation. This non-commuting algebra 
cannot be represented by vectors. So, although rotations have a well-defined magnitude 
and direction, they are not vector quantities. 

But, this is not quite the end of the story. Suppose that we take a general vector a and 
rotate it about Oz by a small angle 8d z . This is equivalent to rotating the coordinate axes 
about the Oz by — 50 z . According to Equations (A.20)-(A.22), we have 

a' ~ a + S0 Z e z x a, (A.45) 

where use has been made of the small angle approximations sin ~ and cos ~ 1 . The 
above equation can easily be generalized to allow small rotations about Ox and Ou by 60 x 
and 60 y , respectively. We find that 

a'~a + 59xa, (A.46) 



a 




Figure A.8: A vector parallelogram. 
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F 




Figure A. 9: A torque. 




Figure A.10: Effect of successive rotations about perpendicular axes on a six-sided die. 
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where 

60 = 69 x e x + 66 v e y + 69 z e z . (A.47) 

Clearly, we can define a rotation vector, 60, but it only works for small angle rotations (i.e., 
sufficiently small that the small angle approximations of sine and cosine are good). Ac- 
cording to the above equation, a small z-rotation plus a small x-rotation is (approximately) 
equal to the two rotations applied in the opposite order. The fact that infinitesimal rotation 
is a vector implies that angular velocity, 

60 

cu = lim — , (A.48) 

st^o 5t 

must be a vector as well. Also, if a' is interpreted as a(t + 6t) in Equation (A. 46) then it 
follows that the equation of motion of a vector which precesses about the origin with some 
angular velocity cu is 

da , . A _ 

— = cu x a. (A.49) 
dt 



A. 9 Scalar Triple Product 

Consider three vectors a, b, and c. The scalar triple product is defined a • b x c. Now, b x c 
is the vector area of the parallelogram defined by b and c. So, a • b x c is the scalar area 
of this parallelogram multiplied by the component of a in the direction of its normal. It 
follows that a • b x c is the volume of the parallelepiped defined by vectors a, b, and c — see 
Figure A. 11. This volume is independent of how the triple product is formed from a, b, 
and c, except that 

a • b x c = — acxb. (A.50) 

So, the "volume" is positive if a, b, and c form a right-handed set (i.e., if a lies above the 
plane of b and c, in the sense determined from a right-hand circulation rule by rotating b 
onto c) and negative if they form a left-handed set. The triple product is unchanged if the 
dot and cross product operators are interchanged, 

abxc = axbc. (A.51) 



The triple product is also invariant under any cyclic permutation of a, b, and c, 

abxc = bcxa = caxb, (A.52) 
but any anti-cyclic permutation causes it to change sign, 

a b x c = — b axe. (A.53) 



The scalar triple product is zero if any two of a, b, and c are parallel, or if a, b, and c are 
coplanar. 
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Figure A.ll: A vector parallelepiped. 



If a, b, and c are non-coplanar then any vector r can be written in terms of them: i.e., 

r=cta+|3b + yc. (A.54) 
Forming the dot product of this equation with b x c, we then obtain 

rbxc = aabxc, (A.55) 

so 

r b x c c ,. 

cc = — - . (A.56) 

a b x c 

Analogous expressions can be written for (3 and y. The parameters <x, (3, and y are uniquely 
determined provided a-bxc/0: i.e., provided that the three vectors are non-coplanar. 



A. 10 Vector Triple Product 

For three vectors a, b, and c, the vector triple product is defined ax (b x c). The brackets 
are important because a x (b x c) ^ (a x b) x c. In fact, it can be demonstrated that 

ax(bxc)E(a-c)b-(a-b)c (A.57) 

and 

(a x b) x c = (a • c) b - (b • c) a. (A.58) 

Let us try to prove the first of the above theorems. The left-hand side and the right-hand 
side are both proper vectors, so if we can prove this result in one particular coordinate 
system then it must be true in general. Let us take convenient axes such that Ox lies 
along b, and c lies in the x-y plane. It follows that b = (b x , 0, 0), c = (c x , c y , 0), and 
a = (a x , dy, a z ). The vector b x c is directed along Oz: i.e., b x c = (0, 0, b x c y ). Hence, 
a x (b x c) lies in the x-y plane: i.e., a x (b x c) = (a y b x c y , — a x b x c y , 0). This is the 
left-hand side of Equation (A.57) in our convenient coordinate system. To evaluate the 
right-hand side, we need a c = a x c x + a y c y and a ■ b = a x b x . It follows that the right- 
hand side is 



RHS = 



( [a x c x + a y c y ]b x , 0, 0) - (a x b x c x , a x b x c y , 0) 
(a y c y b x , -a x b x c y , 0) = LHS, 



(A.59) 
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which proves the theorem. 



A. 1 1 Vector Calculus 



Suppose that vector a varies with time, so that a = a(t). The time derivative of the vector 
is defined 



da 

— = hm 

dt st^o 



a(t + 6t) -a(t) 
St 



When written out in component form this becomes 



da _ / da x da y da z 



dt V dt dt dt 



(A. 60) 



(A.61) 



Suppose that a is, in fact, the product of a scalar 4>(t) and another vector b(t). What 
now is the time derivative of a? We have 



da 



which implies that 



dT = dI ( * bx)= dT^liP 



dt dt v dt 



Moreover, it is easily demonstrated that 

d 



dt 



(a-b) 



da , db 

b + a , 

dt dt' 



(A.62) 



(A.63) 



(A. 64) 



and 



da 



db 



— (axb) = — xb + ax— . (A.65) 
dt dt dt 

Hence, it can be seen that the laws of vector differentiation are analogous to those in 
conventional calculus. 



A. 12 Line Integrals 

Consider a two-dimensional function f (x, y ) which is defined for all x and y . What is meant 
by the integral of f along a given curve joining the points P and Q in the x-y plane? Well, 
we first draw out f as a function of length I along the path — see Figure A. 12. The integral 
is then simply given by 

rQ 

f (x, y ) dl = Area under the curve, (A.66) 

Jp 



where dl = J dx 2 + dy 2 . 
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O 




f 




x 



Q 1 



Figure A. 12: A line integral. 




P = (0, 0) 

Figure A. 13: An example line integral. 



As an example of this, consider the integral of f(x,y) = xy 2 between P and Q along 
the two routes indicated in Figure A. 13. Along route 1 we have x = y, so dl = VI dx. 
Thus, 

' Q xy 1 dl = [ x 3 \/l dx = (A.67) 



The integration along route 2 gives 





pi 




pi 


xu 2 dl = 


xy 1 dx 


+ 


xy 2 dy 


p 





y=0 -J 






x=1 



+ 



1 

3" 



(A. 68) 



Note that the integral depends on the route taken between the initial and final points. 

The most common type of line integral is that in which the contributions from dx and 
dy are evaluated separately, rather that through the path length dl: i.e., 



rQ 



[f(x,y) dx+ g(x,y) dy] 



(A.69) 
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(2, 1) 




P = (1,0) 

Figure A. 14: An example line integral. 



As an example of this, consider the integral 



y dx + x 3 dy 



(A.70) 



along the two routes indicated in Figure A. 14. Along route 1 we have x = y + 1 and 
dx = dy, so 



rQ 



•y dx + x dij 



y dy + {y + 1 ) 3 dy 



Along route 2, 



Jp 



















pi 




r 2 




y dx + x 3 dy 




x 3 du 


+ 


y dx 


Jp 









x=1 


i 



17 



7 
4* 



(A.71) 



(A. 72) 



Again, the integral depends on the path of integration. 

Suppose that we have a line integral which does not depend on the path of integration. 
It follows that 



rQ 



(fdx+gdu)=F(Q)-F(P) 



for some function F. Given F(P) for one point P in the x-y plane, then 



F(Q)=F(P) + 



(f dx + gdu) 



(A. 73) 



(A.74) 



defines F(Q) for all other points in the plane. We can then draw a contour map of F(x,y). 
The line integral between points P and Q is simply the change in height in the contour 
map between these two points: 



(f dx+ g dy) 



dF(x,y)=F(Q)-F(P). 



Thus, 



dF(x,u) = f(x,u) dx + g(x,u) dy. 



(A.75) 



(A.76) 
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For instance, if F = x 3 y then dF = 3 x 2 y dx + x 3 dy and 

rQ 



(3x 2 y dx + x 3 dy) = 



x 3 y 



(A. 77) 



is independent of the path of integration. 

It is clear that there are two distinct types of line integral. Those which depend only on 
their endpoints and not on the path of integration, and those which depend both on their 
endpoints and the integration path. Later on, we shall learn how to distinguish between 
these two types (see Section A. 15). 



A. 13 Vector Line Integrals 

A vector field is defined as a set of vectors associated with each point in space. For instance, 
the velocity v(r) in a moving liquid (e.g., a whirlpool) constitutes a vector field. By analogy, 
a scalar field is a set of scalars associated with each point in space. An example of a scalar 
field is the temperature distribution T(r) in a furnace. 

Consider a general vector field A(r). Let dr = (dx, dy, dz) be the vector element of 
line length. Vector line integrals often arise as 



rQ 



A • dr = 



(A x dx + A y dy + A z dz). 



(A. 78) 



For instance, if A is a force-field then the line integral is the work done in going from P to 
Q. 

As an example, consider the work done by a repulsive inverse-square central field, 
F = -r/|r 3 |. The element of work done is dW = F dr. Take P = (oo, 0, 0) and Q = (a, 0, 0). 
Route 1 is along the x-axis, so 



W 



1 



x z 



dx 



1_ 
a' 



(A. 79) 



The second route is, firstly, around a large circle (r = constant) to the point (a, oo, 0), and 
then parallel to the y-axis — see Figure A. 15. In the first part, no work is done, since F is 
perpendicular to dr. In the second part, 



r0 



w = 



-y dy 

(a 2 + y 2 ) 3 / 2 



1 

W+aW 2 . 



1 

a 



(A.80) 



In this case, the integral is independent of the path. However, not all vector line integrals 
are path independent. 
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Figure A. 15: An example vector line integral. 

A. 14 Volume Integrals 

A volume integral takes the form 

f(x,y,z)dV, 



(A.81) 



where V is some volume, and dV = dx dy dz is a small volume element. The volume 
element is sometimes written d 3 r, or even dr. 

As an example of a volume integral, let us evaluate the center of gravity of a solid pyra- 
mid. Suppose that the pyramid has a square base of side a, a height a, and is composed of 
material of uniform density. Let the centroid of the base lie at the origin, and let the apex 
lie at (0, 0, a). By symmetry, the center of mass lies on the line joining the centroid to the 
apex. In fact, the height of the center of mass is given by 



zdV 



dV. 



(A.82) 



The bottom integral is just the volume of the pyramid, and can be written 







r(a-z)/2 


■(a-z)/2 


dV = 


dz 




dx = 







-(a-z)/2 J 


-(a-z)/2 J 



(a-z) 2 dz 



(a 2 -2az + z 2 ) dz 



1 1 3 

a z — a z + z 



(A.83) 



Here, we have evaluated the z-integral last because the limits of the x- and y- integrals are 
z-dependent. The top integral takes the form 



(za 2 -2az 2 + z 3 ) dz 
(A.84) 







r(a-z)/2 


■(a-z)/2 




zdV = 


zdz 




dx = 


z (a 







-(a-z)/2 J 


-(a-z)/2 J 






a 2 z 2 /2-2az73 + z 4 /4 



12 
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contours of h(x, y) 
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V direction of steepest ascent 
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x 



Figure A. 16: A two-dimensional gradient. 



Thus, 



z = — a 
12 



4 



/ 



— a 3 = — a. 
3 4 



(A.85) 



In other words, the center of mass of a pyramid lies one quarter of the way between the 
centroid of the base and the apex. 

A. 15 Gradient 

A one-dimensional function f (x) has a gradient df/dx which is defined as the slope of the 
tangent to the curve at x. We wish to extend this idea to cover scalar fields in two and 
three dimensions. 

Consider a two-dimensional scalar field H(x, y), which is (say) height above sea-level 
in a hilly region. Let dr = (dx, dy) be an element of horizontal distance. Consider dh/dr, 
where dh is the change in height after moving an infinitesimal distance dr. This quantity 
is somewhat like the one-dimensional gradient, except that dh depends on the direction of 
dr, as well as its magnitude. In the immediate vicinity of some point P, the slope reduces 
to an inclined plane — see Figure A.16. The largest value of dh/dr is straight up the slope. 
It is easily shown that for any other direction 



where 9 is the angle shown in Figure A.16. Let us define a two-dimensional vector, gradh, 
called the gradient of h, whose magnitude is (dh/dr) max , and whose direction is the direc- 
tion of steepest ascent. The cos 9 variation exhibited in the above expression ensures that 
the component of grad h in any direction is equal to dh/ dr for that direction. 

The component of dh/dr in the x-direction can be obtained by plotting out the profile 
of h at constant y, and then finding the slope of the tangent to the curve at given x. This 




(A.86) 



294 



NEWTONIAN DYNAMICS 



quantity is known as the partial derivative of h with respect to x at constant y, and is 
denoted (3h/3x) y . Likewise, the gradient of the profile at constant x is written (3h/3u) x . 
Note that the subscripts denoting constant-x and constant-y are usually omitted, unless 
there is any ambiguity. If follows that in component form 

Now, the equation of the tangent plane at P = (x , u ) is 

H T (x, y) = H(xo, yo) + a(x-xo) + (3 {y-y ). (A.88) 
This has the same local gradients as Tt(x, y), so 

oc = — , (3 = — , (A.89) 
3x oy 

by differentiation of the above. For small dx = x — x and dy — y — y , the function h is 
coincident with the tangent plane, so 

dh=^dx+^du. (A.90) 
3x oy 

But, gradh = (3h/3x, 3h/3u) and dr = (dx, dy), so 

dh = gradh - dr. (A.91) 

Incidentally, the above equation demonstrates that grad H is a proper vector, since the left- 
hand side is a scalar, and, according to the properties of the dot product, the right-hand 
side is also a scalar provided that dr and gradh are both proper vectors (dr is an obvious 
vector, because it is directly derived from displacements) . 

Consider, now, a three-dimensional temperature distribution T(x, y, z) in (say) a reac- 
tion vessel. Let us define gradT, as before, as a vector whose magnitude is (dT/dr) max , 
and whose direction is the direction of the maximum gradient. This vector is written in 
component form 

Here, 3T/3x = (3T/3x) V)Z is the gradient of the one-dimensional temperature profile at 
constant y and z. The change in T in going from point P to a neighbouring point offset by 
dr = (dx, dy, dz) is 

AT AT AT 

dT = — dx + — dy + — dz. (A.93) 
3x oy oz 

In vector form, this becomes 

dT = gradT dr. (A. 94) 
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constant 




gradT 



isotherms 
Figure A. 17: Isotherms. 



Suppose that dT = for some dr. It follows that 

dT = grad T • dr 



0. 



(A.95) 



So, dr is perpendicular to gradT. Since dT = along so-called "isotherms" (i.e., contours 
of the temperature), we conclude that the isotherms (contours) are everywhere perpen- 
dicular to gradT — see Figure A. 17. 

It is, of course, possible to integrate dT. For instance, the line integral of dT between 
points P and Q is written 



fQ 



dT 



gradT - dr = T(Q) - T(P). 



(A. 96) 



This integral is clearly independent of the path taken between P and Q, so Jp grad T ■ dr 
must be path independent. 

Consider a vector field A(r). In general, the line integral Jp A • dr depends on the 
path taken between the end points, but for some special vector fields the integral is path 
independent. Such fields are called conservative fields. It can be shown that if A is a conser- 
vative field then A = grad V for some scalar field V. The proof of this is straightforward. 
Keeping P fixed, we have 



rQ 



A-dr = V(Q), 



(A.97) 



where V(Q) is a well-defined function, due to the path independent nature of the line 
integral. Consider moving the position of the end point by an infinitesimal amount dx in 
the x-direction. We have 



V(Q + dx) = V(Q) + 



rQ+dx 



A - dr = V(Q) + A x dx. 



Hence, 



ox 



(A.98) 



(A.99) 
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with analogous relations for the other components of A. It follows that 

A = gradV. (A. 100) 

In Newtonian dynamics, the force due to gravity is a good example of a conservative 
field. Now, if A(r) is a force-field then J A ■ dr is the work done in traversing some path. If 
A is conservative then 

oA-dr = 0, (A.101) 

where § corresponds to the line integral around a closed loop. The fact that zero net work 
is done in going around a closed loop is equivalent to the conservation of energy (which is 
why conservative fields are called "conservative") . A good example of a non-conservative 
field is the force due to friction. Clearly, a frictional system loses energy in going around a 
closed cycle, so § A • dr ^ 0. 



A. 16 Grad Operator 

It is useful to define the vector operator 

which is usually called the grad or del operator. This operator acts on everything to its 
right in a expression, until the end of the expression or a closing bracket is reached. For 
instance, 

(&£•£)• 

For two scalar fields 4) and ip, 

grad (cp \p) = tp gradi]; + ip grad cf) (A.104) 

can be written more succinctly as 

V(cp-ip) = cp Vip + ip Vcp. (A.105) 

Suppose that we rotate the coordinate axes through an angle about Oz. By analogy 
with Equations (A.17)-(A.19), the old coordinates (x, y, z) are related to the new ones 
(x', y', z') via 

x = x' cos 9 -y' sinB, (A.106) 
y = x'sin9 +y' cos 9, (A.107) 
z = z'. (A. 108) 
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Figure A. 18: Cylindrical polar coordinates. 



Now, 



giving 



and 



3x 



3^ 



3z 



3x' \dx>J ylz , 3x V 9x 7y',z' 9 y \ dx 'J y >,z' 9z 



9 n 9 • n 9 

— = cos9 — + sine— , 
3x' 3x 3u 



(A. 109) 



(A. 110) 



V X ' = cos6 V x + sine V y . (A.111) 

It can be seen, from Equations (A.20)-(A.22), that the differential operator V transforms 
in an analogous manner to a vector. This is another proof that Vf is a good vector. 



A. 17 Curvilinear Coordinates 

In the cylindrical coordinate system, the Cartesian coordinates x and u are replaced by 



r = yx 2 +y 2 and 6 = tan _1 (y/x). Here, r is the perpendicular distance from the z-axis, 
and 6 the angle subtended between the perpendicular radius vector and the x-axis — see 
Figure A. 18. A general vector A is thus written 

A = A T e r + A e e + A z e z , (A. 112) 

where e T = Vr/|Vr| and e = V6/|Ve| — see Figure A.18. Note that the unit vectors e T , 
e e , and e z are mutually orthogonal. Hence, A T = A • e r , etc. The volume element in this 
coordinate system is d 3 r = r dr d6 dz. Moreover, the gradient of a general scalar field V(r) 
takes the form 

3V 1 3V 3V 
VV = — e r + -— e + — e z . (A.113) 
3r r 36 3z 
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Figure A. 19: Spherical polar coordinates. 



In the spherical coordinate system, the Cartesian coordinates x, y, and z are replaced 



by r = y x 2 + y 2 + z 2 , 9 = cos -1 (z/r), and 4> = tan -1 (y/x). Here, r is the radial distance 
from the origin, the angle subtended between the radius vector and the z-axis, and 4> 
the angle subtended between the projection of the radius vector onto the x-y plane and 
the x-axis — see Figure A. 19. Note that r and 9 in the spherical system are not the same as 
their counterparts in the cylindrical system. A general vector A is written 

A = A T e r + A e e + A^e^, (A.114) 

where e r = Vr/|Vr|, ee = V0/|V9|, and = Vcp/|Vcp|. The unit vectors e r , e e , and 
are mutually orthogonal. Hence, A r = A ■ e T , etc. The volume element in this coordinate 
system is d 3 r = r 2 sin 9 dr d9 deb. Moreover, the gradient of a general scalar field V(r) 
takes the form 

m( 3V 1 3V 1 3V , A 

VV = — e r + -— e + ^— — e^. (A.115) 
or r 39 r sin 9 34) 



A. 18 Exercises 

A.l. The position vectors of the four points A, B, C, and D are a, b, 3a + 2b, and a — 3b, 

— > — > — > — > 
respectively. Express AC, DB, BC, and CD in terms of a and b. 

A.2. Prove the trigonometric law of sines 

sin a sin b sin c 
~A~ = ~B~ = ~C~ 

using vector methods. Here, a, b, and c are the three angles of a plane triangle, and A, B, 
and C the lengths of the corresponding opposite sides. 
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A.3. Demonstrate using vectors that the diagonals of a parallelogram bisect one another. In addi- 
tion, show that if the diagonals of a quadrilateral bisect one another then it is a parallelogram. 

A.4. From the inequality 

a b = |a| |b| cos 9 < |a| |b| 

deduce the triangle inequality 

|a + b|<|a| + |b|. 

A. 5. Find the scalar product a • b and the vector product a x b when 

(a) a = e x + 3 e y - e z , b = 3 e x + 2 e v + e z , 

(b) a = e x - 2 e y + e z , b = 2 e x + e y + e z . 

A.6. Which of the following statements regarding the three general vectors a, b, and c are true? 

(a) c • (a x b) = (b x a) • c. 

(b) a x (b x c) = (a x b) x c. 

(c) a x (b x c) = (a • c) b — (a • b) c. 

(d) d = A a + \xb implies that (a x b) • d = 0. 

(e) a x c = b x c implies that c • a — c • b = c |a — b|. 

(f) (a x b) x (c x b) = [b • (c x a)] b. 

A. 7. Prove that the length of the shortest straight-line from point a to the straight-line joining 
points b and c is 

|axb + bxc + cxa| 

A.8. Identify the following surfaces: 

(a) |r| = a, 

(b) r n = b, 

(c) r • n = c |r|, 

(d) |r — (r ■ n) n| = d. 

Here, r is the position vector, a, b, c, and d are positive constants, and n is a fixed unit vector. 
A.9. Let a, b, and c be coplanar vectors related via 

cta+ (3b + yc = 0, 

where a, (3, and y are not all zero. Show that the condition for the points with position 
vectors u a, v b, and w c to be colinear is 

U V w 
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A.10. If p, q, and r are any vectors, demonstrate that a = q + A r, b = r + yp, and c = p + y q are 
coplanar provided that A \iy = — 1, where A, \x, and y are scalars. Show that this condition is 
satisfied when a is perpendicular to p, b to q, and c to r. 

A.ll. The vectors a, b, and c are non-coplanar, and form a non-orthogonal vector base. The vectors 
A, B, and C, defined by 

b x c 

A= — u ' 

a • b x c 

plus cyclic permutations, are said to be reciprocal vectors. Show that 

a=(Bx C)/(A-B x C), 

plus cyclic permutations. 

A. 12. In the notation of the previous question, demonstrate that the plane passing through points 
a/a, b/(3, and c/y is normal to the direction of the vector 

h = cxA+ |3B + yC. 

In addition, show that the perpendicular distance of the plane from the origin is |h| _1 . 
A. 13. Evaluate § A • dr for 

xe x + ye y 



around the square whose sides are x = 0, x = a, y = 0, y = a. 
A.14. Find the gradients of the following scalar functions of the position vector r = (x, y, z) 

(a) k-r, 

(b) |rr, 

(c) |r-kr, 

(d) cos(k-r). 

Here, k is a fixed vector. 



